1 AAGCGATAGC TGAGTGCGGC GGCTGCTGAT TGTGTTCTAG GGGACGGAGT 

51 AGGGGAAGAC GTTTGCTCTC CCGGAACAGC CTATCTCATT CCTTTCTTTC 

101 GATTACCCGT GGCGCGGAGA GTCAGGGCGG CGGCTGCGGC AGCAAGGGCG 

151 GCGGTGGCGG CGGCGGCAGC TGCAGTGACA TGTCCAGCAT GAATCCCGAA 

201 TATGATTATT TATTCAAGTT ACTTCTGATT GGCGACTCAG GGGTTGGAAA 

251 GTCTTGCCTT CTTCTTAGGT TTGCAGATGA TACATATACA GAAAGCTACA 

301 TCAGCACAAT TGGTGTGGAT TTCAAAATAA GAACTATAGA GTTAGACGGG 

351 AAAACAATCA AGCTTCAAAT AGAGTCCTTC AATAATGTTA AACAGTGGCT 

401 GCAGGAAATA GATCGTTATG CCAGTGAAAA TGTCAACAAA TTGTTGGTAG 

451 GGAACAAATG TGATCTGACC ACAAAGAAAG TAGTAGACTA CACAACAGCG 

501 AAGGAATTTG CTGATTCCCT TGGAATTCCG TTTTTGGAAA CCAGTGCTAA 

551 GAATGCAACG AATGTAGAAC AGTCTTTCAT GACGATGGCA GCTGAGATTA 

601 AAAAGCGAAT GGGTCCCGGA GCAACAGCTG GTGGTGCTGA GAAGT CCAAT 

651 GTTAAAATTC AGAGCACTCC AGTCAAGCAG TCAGGTGGAG GTTGCTGCTA 

7 01 AAATTTGCCT CCATCCTTTT CTCACAGCAA TGAATTTGCA ATCTGAACCC 

751 AAGTGAAAAA ACAAAATTGC CTGAATTGTA CTGTATGTAG CTGCACTACA 

801 ACAGATTCTT ACCGTCTCCA CAAAGGTCAG AGATTGTAAA TGGTCAATAC 

851 TGACTTTTTT TTTATTCCCT TGACTCAAGA CAGCTAACTT CATTTTCAGA 

901 ACTGTTTTAA ACCTTTGTGT GCTGGTTTAT AAAATAATGT GTGTAATCCT 

951 TGTTGCTTTC CTGATACCAG ACTGTTTCCC GTGGTTGGTT AGAATATATT 

1001 TTGTTTTGAT GTTTATATTG GCATGTTTAG ATGTCAGGTT TAGTCTTCTG 

1051 AAGATGAAGT TCAGCCATTT TGTATCAAAC AGCACAAGCA GTGTCTGTCA 

1101 CTTTCCATGC ATAAAGTTTA GTGAGATGTT ATATGTAAGA TCTGATTTGC 

1151 TAGTTCTTCC TTGTAGAGTT ATAAATGGAA AGATTACACT ATCTGATTAA 

1201 TAGTTTCTTC ATACTCTGCA TATAATTTGT GGCTGCAGAA TATTGTAATT 

1251 TGTTGCACAC TATGTAACAA AACAACTGAA GATATGTTTA ATAAATATTG 

1301 TACT TAT TGG AAGTAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

1351 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

1401 AAAAA (SEQ ID NO:l) 



FEATURES : 

5 T UTR : 1-179 

Start Codon: 180 

Stop Cocion: 69 9 

3 T UTR : 702 



Homologous proteins : 

Top 10 BLAST Hits 



CRA1 10800002 4 647144 
CRA|18000004923424 
CRA (18000004 93 7 406 
CRA] 18000004952860 
CRAI18000004995539 
CRAI18000004967528 
CRAj 18000004880958 
CRA[ 18000004908714 
CRAI18000005175724 
CRA1335001098696672 



/altid=gi 
/altid=gi j 
/altid=gi I 
/altid=gi i 
/altid-gi | 
/altid=gi| 
/altid=gi| 
/ altid-gi I 
/altid=gi I 

/altid=gi 



112728868 /def=ref I XP_002675 . 2 j RA. 
4758988 /def=ref 1 NP_004152 . 1 | RAB1 . 
131787 /def=spjP05711|RBlA_RAT RAS . 
131785 /def=sp|P22125|RABl_DISOM R. 
103720 /def-pir| ID38625 GTP-bindin. 
92339 /def^pir j | S06147 GTP-binding. 
464524 / def =sp j Q0597 4 | RAB1__LYMST R. 
466171 /def=spiP33723 lYPTlJffiUCR G. 
7497231 /def-pir NT33781 hypotheti. 
111558649 /def=emb|CAC17833.1 i (AJ. 



Score 


E 


372 


e-102 


332 


5e-90 


328 


le-88 


320 


3e-86 


313 


3e-84 


297 


2e-79 


282 


9e-75 


253 


3e-66 


253 


4e-66 


251 


2e-65 
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BLAST dbEST hits: 









Score 




E 


gi | 12867866 


/dataset^dbest 


/taxon=960 . . . 


654 


0. 


0 


gi | xzuy / o^u 


/ dataset=dbest 


/taxon=96. . . 


654 


0. 


0 


gi(12793758 


/dataset=dbest 


/taxon-960. . . 


624 


e- 


-177 


gi I 12338056 


/dataset=dbest 


/ taxon=96 . . . 


622 


e- 


176 


gi|11977068 


/dataset^dbest 


/taxon=96 . . . 


609 


e- 


-172 


gi j 10339840 


/dataset=dbest 


/taxon-960 . . . 


517 


e- 


-145 


gi|10349761 


/dataset=dbest 


/taxon=960. . . 


436 


e- 


-120 


gi|10997958 


/dataset=dbest 


/taxon-96 . . . 


385 


e- 


105 


gi|10996533 


/dataset=dbest 


/ taxon=96 . . . 


381 


e- 


103 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 
From BLAST dbEST hits: 
gi 1 12867866 Fetal brain 
gi 1 12097820 Adrenal gland 

gi 112793758 Brain neoroblastoma cell line 

gi 112338056 Adrenal gland 

gi 1 11977068 Skin melanotic melanoma 

gi 1 10339840 Uterus leiomyosarcoma 

gi 1 10349761 Skin melanotic melanoma 

gi i 10997958 Placenta 

gi i 10996533 Placenta 

From tissue screening panels: 
Whole brain 
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1 MSSMNPEYDY LFKLLLIGDS GVGKSCLLLR FADDTYTESY ISTIGVDFKI 

51 RTIELDGKTI KLQIESFNNV KQWLQEIDRY ASENVNKLLV GNKCDLTTKK 

101 VVDYTTAKEF ADSLGIPFLE TSAKNATNVE QSFMTMAAEI KKRMGPGATA 

151 GGAEKSNVKI QSTPVKQSGG GCC (SEQ ID NO: 2) 



FEATURES : 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYCOSYLATION 
N-glycosylation site 

125-128 NATN 

[2] PDOC00005 PS00005 PKC__PHOSPHO_SITE 
Protein kinase C phosphorylation site 

Number of matches: 5 

1 59-61 TIK 

2 97-99 TTK 

3 98-100 TKK 

4 106-108 TAK 

5 122-124 SAK 

[3] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 

Number of matches: 3 

1 35-38 TYTE 

2 106-109 TAKE 

3 127-130 TNVE 

[4] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

30-36 RFADDTY 

[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 3 

1 21-2 6 GVGKSC 

2 147-152 GATAGG 

3 152-157 GAEKSN 

[6] PDOC00017 PS00017 ATP_GTP_A 
ATP/GTP-binding site motif A (P-loop) 

18-25 GDSGVGKS 

[7] PDOC00579 PS00675 SIGMA54_JNTERACT_1 

Sigma-54 interaction domain ATP-binding region A signature 
14-27 LLLIGDSGVGKSCL 
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BLAST Alignment to Top Hit: 

>CRA|108000024647144 /altid=gi 1 12728868 /def=ref I XP_002675 . 2 | RAB1, 
member RAS oncogene family [Homo sapiens] /org^Homo 
sapiens /taxon=9606 /dataset-nraa /length=222 
Length = 222 

Score = 372 bits (944), Expect - e-102 

Identities = 190/222 (85%), Positives = 190/222 (85%), Gaps - 32/222 (14%) 
Frame - +3 

Query: 129 GGCGSKGGGGGGGSCSDMSSMNPEYDYLFKLLLIGDSGVGKSCLLLRFADDTYTESYIST 308 

GGCGSKGGGGGGGSCSDMSSMNPEYDYLFKLLLIGDSGVGKSCLLLRFADDTYTESYIST 
Sbjct: 1 GGCGSKGGGGGGGSCSDMSSMN PEYDYLFKLLLIGDSGVGKSCLLLRFADDTYTESYIST 60 

Query: 30 9 IGVDFKIRTIELDGKTIKLQI ESFNNVK 3 92 

IGVDFKIRTIELDGKTIKLQI ESFNNVK 
Sbjct : 61 IGVDFKIRTIELDGKTIKLQIWDTAGQERFRTITSSYYRGAHGIIVVYDVTDQESFNNVK 120 

Query: 393 QWLQEIDRYASENVNKLLVGNKCDLTTKKVVDYTTAKEFADSLGIPFLETSAKNATNVEQ 572 

QWLQEIDRYASENVNKLLVGNKCDLTTKKWDYTTAKEFADSLGIPFLETSAKNATNVEQ 
Sbjct: 121 QWLQEIDRYASENVNKLLVGNKCDLTTKKWDYTTAKEFADSLGIPFLETSAKNATNVEQ 180 

Query: 573 S FMTMAAE IKKRMGPGATAGGAEKSNVKI QST PVKQSGGGCC 698 

SFMTMAAEIKKRMGPGATAGGAEKSNVKIQSTPVKQSGGGCC 
Sbjct: 181 S FMTMAAE IKKRMGPGATAGGAEKSNVKI QST PVKQSGGGCC 222 (SEQ ID NO: 4) 



Hmmer search results (Pfam) 

Model Description 



PF00071 Ras family 

CE00060 CE00060 rab_ras_like 

PF00634 BRCA2 repeat. 

PF00056 lactate/malate dehydrogenase 



Score E-value N 

256.4 7.7e-75 2 

170.0 3.9e-47 2 

9.9 0.39 1 

3.9 3.4 1 



Parsed for domains : 



Model 


Domain 


seq-f 


seq-t 


hmm-f 


hmm-t 


score 


E- 


-value 


PF00056 


1/1 


13 


29 . 


1 


18 [. 


3.9 




3.4 


CE00060 


1/2 


8 


64 . 


20 


77 . . 


86.8 


8 


. 9e-23 


PF00071 


1/2 


13 


64 . 


1 


52 [. 


111.9 


4 


,8e-32 


PF00634 


1/1 


57 


79 . 


13 


35 .] 


9.9 




0.39 


CE00060 


2/2 


65 


140 . 


110 


188 . . 


81.2 


2 


9e-21 


PF00071 


2/2 


65 


173 . 


] 85 


198 .] 


142.4 


4 


5e-41 
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1 TTTTGGGTGT GTGTGTGTGT GTGTGTGTGT GTGCCTTTAC TAGTGACTCA 
51 GGTCACAGTT TTCTGAGATT TTTTTTCTCC CCTCAAGACA GAATCTTGCT 
101 CTGTCGCCCA GGCTGGAGTG CAGTGGCCTC TCGGCCCACT GTAGCCTCCG 
151 CCTCCCGGGT TCAAGCAATT TTCCTGCCTC AGCCTCCCGA GTAGCTGGGA 
201 TTACAGGCAC GCGCCACCAT GCCTGGCTAA TTTTTGTATT TTTAGTAGAG 
251 ACAGTGTTTC ACCATGTTGG CCAGGCTGGT CTTGAATTCC TGACCTCGTG 
301 ATCTGTCCGT TTTGGCCTCT CAAATTCCTG AGATTACAGG CAT GAG CC AC 
351 CGAGCCTGGC CAGTTTTCTG AGTTTTTATT TGAAATCAAA ATAAGCTTTT 
4 01 TTTTTTTTTT TAATGGGCTT TAGAGTCCAG GGTAACGAAC ACTTTTTGGT 
451 GCCTATTACT GAACCATTCA GGGTATTCCT GGGGTGGTGA CCGTGTTCAT 
501 TTCAGAAACC AACATGTTCA TTTCAGAAAC CAAACTCGGG TAACTTTTGA 
551 TAAGTTCATC AACTAAGGCC CATGGCAGAA TTTGAGGGCT AAGGGGTGTA 
601 ATTAGTGTAT GGGTAGAAAT AAGTGCCTTC TTTCTATATT TTGGCGTTGT 
651 AGGAATTTAA AGTGATTCTG CAGTAAGTCT CAGGAGACAA TTTTCTTAGT 
701 TCTTAGAAGT TGGAAGATAA ACTTTGGACA ATGTATTACA CTATGCCCTT 
751 TGTAATTAAA TAACTCAAGA TAATGTGTTA AAGTTTAGCG GAGATTTAAA 
801 TTCCTGAGCT GATTAAAGAG AGCTGTTAAG GCCATAGGTT TTTTAAAAAT 
851 GAGTTAATAT TACTCCCAGA AATTGTAGGC ACTATATAGT GATGAATTGC 
901 ATATTTTTAT TGCTTATTAT TTTCCAGTCT TGCAGAATGG CTCAGGGTTA 
951 GTAGCAACTA AAAGATAATA CATTACAATT CAACCTGAAG GCCGGGACGA 
1001 AGGTAGGAAT TGGATTTTAG GCTGGCTCTG GGCTGTGTCC CTCCCATCCA 
1051 TGGGATGTGG AGCCATTGAA GGTTGTGGGG TCACGATGCA GGTGCTGTCT 
1101 CAGAAAGATA CATCCGACTG TGTGTGCAAA TGGGCTGGGG CGGAGAAGAG 
1151 AGAGAGAGGT AGAGTCCATT TGGAGACTAC TGCAATAGCC AGGCTGACGA 
1201 GTTAAGAGCG GGGCACAGTA AGAATGGGAA GAAATCTAAG AAGAAAATGG 
1251 TAGTGCGCGG GGCCAACAAT GGACGATGAC CGAACCCAGG TGGGGATGGG 
1301 TGAGTGACGA GAAGAACCGC TCCGTGCCGT CCAGGGAGCC CCTTGACTTC 
1351 CCTTCTGTTC TTAGAGCGGA CGTCCTCCTA CCAGCCCCCA ACCAGCGCCA 
14 01 CCAGGGTGGC GCAAGCCTCA AGCTGGTCAG GTCAGCAACA GCCGCAACGG 
14 51 AGGCAGGAGC CGACACGCTC GTACCCCGGC CCCCTCCCCG CCCCCGCACC 
1501 CCCGGCAGTC CCTCCGGTTT GACCACTCCC CCCGGTCCCT TGCCTCCCCC 
1551 GACCCCCAGC CTCCGTCGGC CGCCGGCACC ACCCTCCGCC CCTCTCCGCC 
1601 CCCTCCCCCG TGGGGCGCTG ACTCGCCCGG CTGCCACGTC TCACTGATGA 
1651 CATCACTAGG GCAGCTCGGC CTTAGCCAAT CCGCCAGGGG GAGTCCGAGC 
1701 GAAGTCCTAG CCAGCGAGTC AGAGGGGAGG GGAGCAGGGA GGGGCCGAGG 
1751 GTGGGGAGGT GAGGGAGTGG GGAATGGGGC GGGCGACAAC CCTTCAGGTA 
1801 CGCATGCCCC AGAGGCGCGG CGCTTGGCGG GAAGCTGAGT CCTGGCCTTG 
1851 CGTCGCACTG TCTGTCCTCA GCTCGCGTAG CCGCGCTCGC GACTCCCTTT 
1901 CCCGGCATGC CAGGCGGTGC GGCCGCCCTC TGGGCCGTGT AAAGGCCCCT 
1951 CGGTCTAAGG CTTCCCTATT TCCTGGTTCG CCGGCGGCCA TTTTGGGTGG 
2001 AAGCGATAGC TGAGTGGCGG CGGCTGCTGA TTGTGTTCTA GGGGACGGAG 
2051 TAGGGGAAGA CGTTTGCTCT CCCGGAACAG CCTATCTCAT TCCTTTCTTT 
2101 CGATTACCCG TGGCGCGGAG AGTCAGGGCG GCGGCTGCGG CAGCAAGGGC 
2151 GGCGGTGGCG GCGGCGGCAG CTGCAGTGAC ATGTCCAGCA TGAATCCCGA 
2201 ATAGTGAGTT CAGGAGAGCA CCGGTCGGCT GGGTCCGTGG GCCAGCTTGG 
2251 GGGATCTTAA AGGGGTCGAG GAGGGTTGGG GCAGAAGTCG GGGCATCGGC 
2301 TGGGGTGAGG CGAGGGTGAT GGGTCAGGAG AGGCTGGCGG CCGGGAGTCG 
2351 GGCCCCATTG TCTGACGCGG AGGGGCGGCC GCGCGGGGGA GGGGTCGGGC 
24 01 CGGAGGGGTG AGCCGCCCGG GCCTGGACCG GGTCAGGTTA GAGGGCCTGA 
24 51 CTGCGGGGCG GGTGCTGAGG AAGCCTGCCG AGGGGCCTGG GGCGGTGTGA 
2501 AGGGGTATCT TCTCTCGGAG GCAGTGACTT TTGAAGGAGG ACTTGTCTCT 
2551 AAGGGGAGGG GATGGGGTGG GAGAGCCCTT CTAGAGGGCA CTGTCAGACC 
2601 CTGCGCCCGC ACTCTGCGGA GCTGTCAGGA TCTTCGGGGT AGAAACCAGC 
2 651 TTTACTTGTA AATCCTGAGC TTGTTGGGTC TCTCTCCTTC CATCCTCCCC 
2701 GCCAGGTTTC AGGTAATATG GATGCTTTTC GGGACTGCGT GGGATTGAGG 
2751 GGAATGAGTA GATGGTGAGA AGCAACTGAA CATTTATTAG TTCTCTTTTT 
2801 GAGTTGTGTC TTGGAGGAGT TGTTTAAGAG CTCGCCGGGT CCATTGCCCT 
2851 CCTATAAAAA CCTGGGCATT TGTGAGAATT TTGTTTTTTT TTTTTTTAAA 
2901 GAGGACACCT AAGTCATTTT GTCTTCTGTG GGTCAAGGGA AAAAAAAAAA 

2 951 ACTAAAGCCA AGAAATGTCT TTTTGATACT CGCAGATTAA AGGAAGCTTG 

3 001 CTGTCAAGTT GAAAGAGAAA CGAACGGGAC CTATGATAGA TCTGTATGTA 
3051 GGTTTTGGAT TACCTGCTTG GATGCTTGCA GATAGGGAAT GAGGTTCCAT 
3101 GACGTGTCAT GAAAAGTTAA TGCATTTCTT TTTCTTGCTT ACTCAAGAAG 



FIGURE 3, page 1 of 21 



3151 TCACCACAGC AGATGTGACA CACCTGGCAC CTTTCCTGGG AACTGGTGTT 
3201 CACTTCCCTT GGGTAGAGTT TGTTGGGCTC TCCTCAATGG CCCTTTAAAA 
3251 ATTTCCTCTA CAGTTTACAT GCATGTAAAG TAATGAATAA TTGGAAGAGA 
3301 CCGAATTGGT ATTCCTTTTC AGTGTCAAAG GCCTTTGAGG GATGGGGGAA 
3351 AATCAGTATT TGTTGTAAAA GTTGAGTTTA TTTGCTGGTT TGGTCAATTA 
3401 CTGCTAGACA TTTTCCCCTA AAAGGTCCAC CCACCAGTTT AGCTGACTGT 
3451 CATATGTGTG TCACATGGCT CTTGCAAAAT GCTTACAAGT TTTGTAATAG 
3501 TGTGGCTTGA AGCTGAAATC TTTTGCACTA AACAGAAACC GTAGTATTTT 
3551 ATTAGAATTT CATGCTTTAG AAGTTGAGGG TAGTGTTCTT GTAGTGACAT 
3 601 TTGCTGTGTT GACAGTTTAA AAAAATTTTT TTTTCAAGGG CTCCAAGGAC 

3 651 AAAGTTGGTT TTGCACAGTT GAACGGAGGT GAACTTGAGG TTCTTAATTT 
37 01 AGTAGTTTTC TTGGTAACAA TAAAGAACAT GGATTTACTG CTTTATCGAG 
3751 GTTTATAGAC CTCTACTGTT CAGGAAATTT TCTGAATTTG C TAT AT AT AT 
3801 GTTTATTAGT GTAAATAAAT CTTCAAGATT AGTTGAGAAC TTTGACAAGT 
3851 TACTCAGCCT CTGAATTTTT TTTCCCTTTT GTAAAATAGG ATAATTGGAG 
3901 TCATTATTCC TGTCAGGGTA GTGGTGAAAT TCAAATGTAT ATAAAAGAAT 
3951 TTGAAAAACT GTGTGAGCAT TCTTCAGGTG GTATGCATCA TTTTCATGAA 
4001 AGGCATTCTA TTAGTACCAG GATTTAGGAA TATAATCCTT GCGCTTAAGA 

4 051 AGTTTAGATA TAGGCCAGGC GCGGTGGCTC ACCTCAGTAA TCCCAGCACT 
4101 TTGGGAGGCC GAGGCGGGCG GATCCCGAGG TCAGGAGATC GAGACCATCC 
4151 TCGGTAACAC GGTGAAACCC CGTCTCTACT AAAAATGCAA AAAAATTAGC 
4201 CGGGCGTGGT GGTGGGCACC TGTAGTCCCA GCTACTCGAG AGGCTGAGGC 
4251 AGGAGAATGG CGTGATCCCG GGAGGTGGAG CTTGCAGTGA ACCAAGATCT 
4301 GGCCACTGCA CTCCAGCCTG GACGACAGAG CAAGACTCCG TCTCAAAAAA 
4351 AAAATTATTT ATTGTTTTGA GACGGAGTTT CAATCTTGTT GCCCAGGCTG 
44 01 GAGTGCAATG GCGCAAATCT CCTCTCACCG CCACCTCCGC CTCCTGGGTT 
4 4 51 CAAGTGATTC TCCTGCCTCA GATTCCCGAG AAGTTGGGAT TACAGGCATG 
4501 TGCCACCACT CCCGGCTAAT TTTGTATTTT TGGTAGAGAC GGGGTTTCTC 
4551 CATGTTGGTC AGGCTGGTCT CAAACTCCCG AAGTGATCCG CCCGCCTCAG 
4 601 CTTCCCAAAG TGTTGGGATT ACAGGCGTGA GCCACCGCGC CCGGCAGAAA 
4 651 TAGATTTTAT ACATGTCAAA TACCAGTAGA TATAGCAAAT TCCAGATGTG 
4701 TGGCATGGAT GAGAGCAACA AGATTTCAGG GGGATGGTGG GTTGTGGTTG 
4751 GCTATCTGGG TTTTGGAAGA CTTTATAGAA GAGAGACCTG AAAGGGATTT 
4801 ATCAGCAATT AGATTTGGAG GAACAGAGGG AGTGACTAGG AATTTTCAAG 
4 851 GGGGAGAAGA AGGAGGAATG GCTCATAAAT GACAAGGACA GTAATAAGTA 
4 901 AATACGGTGT CAAATCATCC TTTCTTTTGA AGACTAATGA CCTCAAAGGG 
4 951 ATCAAACCCA GAAACAGTTT TTATATTTTT TCTGGGATCA AATACATGGG 
5001 TATCTGGCCT ACTATATTTG TATTCTAGAC TGTTTAGTAA AATAATACAG 
5051 GAATTTGAGA AAACCTTTGC AAAAGTGTTA GTGAAAATTA CTTAGGGTGA 
5101 GAGGAAGTGA GGGATATTTT ATTAGGGGAG GTCACAAGGG CAGTGAGCAA 
5151 TCAGATTTTT AGTAATCTGA CTTAAGCAGT TTCTTTTTGT TTTAATGAAG 
5201 CTTGTTATCT TTATAAAAGT AATTAGAGAA AATTTGGAAA ATAAAGGAAA 
5251 GAAAGAAAAG TTCTTTAGTG TTTTATCACG CAAATACAAG CTCATTCGTT 
5301 TTTAACATCT TGTTCCAAAC TCCAAAGTCT TGCTTTCTCT TCAATTAAAA 
5351 CTTTAATGGG TGGATGCTTT TCCTGCTTCC AGTATGTTAT CTTAATAACT 
54 01 AACAATGGTA TATTAGCTAA TGTTTACAAA TGTACTCCAG ATGTTCCTTA 
5451 AGTTACTTTG GTTTATCATT ACCAATTTAT ATTGTTTCTT TTAGAAATTT 
5501 ATAATCTTTG TTAATGGGTT CTGCTAAATT TGGTAGTGAA AATGGGATCT 
5551 TGAGAAAAAA GATTCTGAAG CAACAGAATT TTTAGATTTA TATTGGTTTA 
5601 CATAAGAGTT GGTAGCTGTA TTACTTTTTT TGTTTGTTTT GTTTTTTTTT 
5651 TGAGACGGAA TCTTGCTCTG TCGCCCAGGC CTTGGCCTCC CAAAGTGTTG 
5701 GGATTACAGG CGTGAGCCAC TGTGCCTGGC TGTTTGTGTT TTTTTTTGTT 
5751 TTTGTTTTCT TTTCTTTTTC TTTTTTTCGA GATGGAGTCT CACTCTGTCA 
5801 CCCAGGCTGG AGTGCAGTGG CGCGATCTTG GCTCACTGCA ATCTCTGCCT 
5851 CCTGGGTTCA AGCGATTTTC CTGCCTTGGT CTCCTGAGTA GCTGGGATTA 
5901 CAGGCATTTG CCACCATAAC CAGCTAATTT TTGTATAGAG TACCCAGCCA 
5951 TCTCTAATGT TGATCAGGCT GAAGCAGGTG GATCACCTAA GGTCAGGAGT 
6001 TCAAGACCAG CCTGGCCAAT ATGGCAAAAC CCTATCTCTA CTAATACAGA 
6051 AAATTATCTG GGTGTGTTGG CTGGCGCCTG TAATCCCAGC TACTCGGGAG 
6101 GCTGAGGCAG GACAATCTCT TGAACCTCGG AGGTGGAGGT TGCAGTGAGC 
6151 CGAGATCACA CCATTGCACT CCAGCCTGGG CAACAGAGCA AGACTTGTCT 
6201 CAAAAAAAAA AAAAAAAAAA AAAAAAAGGC AATTGAAAGT GTAATCTGAA 
6251 CAGTTAAAAA AGTAGATAGA AAGGGTTAAA GCTTTTTTTT GAGGATCTGA 
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6301 AGAAAAATGT GGATTTTTTT TGAGCTACGT TTTGAAGCAG GCAGTGATTA 
6351 TTTCAGCACA TTAAGAAATG CTTAACATGG CCAGGCGCAG TGGCTCACGC 
6401 CTGTAATTCT CAGCACTTTG GGAGGCCGAG GTGGGCGGAT CATTTGAGGT 
6451 CATGACCAGC CTGGCCAACA TGATGAGACA CTGCCTCTAC TAAAAATACA 
6501 AAAATTAGCT GGGTGTGGTG GTGCACGCCT GTAATTCCAG CTACTCAGGA 
6551 ACCTGAGGCA GGAGAGTCAC TTGAACCTGG GAGGCGGAGG CTGCAGTGAG 
6601 TCCAGATCAT GCCACTGCAC TCCAGCCTGA GGGACAGAGT GAGACTCCTC 
6651 AAAAAAAAAA AAAAAAAAAG AAAGAAATAC TTAACATTAT TCTCGTGATT 
6701 ATTCTCATAA CATTTTTCAT AATCCACTGG CTTCCAGTGG ATTTTTTTAG 
6751 TGTCAAGAAA ATAATTTTGA TTGGTTCATC TTTAAGGAAT GTGTTAAGAA 
6801 TAAAGCATGT CTACCTGTCT TCAGTATACC AGCTAACTAT AGTAGGAAGA 
6851 AATATAGTAG TCTACTTAGA TCAACTATAA TTCTTTAATG CAGAAAAAGT 
6901 TTAAAGTATT TACCTTATTT TTAGCCCCCA TCCCCTTAAG TATATCATGG 
6951 CTCCAGAATC TCTGAAAATG TTATCAGTCT TTCAGACTTT GCTCTTCTTT 
7001 CAT GT TAT AC TCAAGAAACA TTTGACCTTT TTTTTTTTTT TTTTGCTTGC 
7051 ATTGTGTTTC AAATAATTTT TAACAAAACT TAAGTGTTTG AAAGTGAAAG 
7101 CAGGTTGTCT TTGTGACTTT TGGTGGTGGT TTGAAAAACT CAGAAAAGTT 
7151 TAAAGAAGAA AGATAACTAG TATTCTCATT GTCCAGAATA TGATTTTTTA 
7201 AATGTCTATA GAATATCACC ATCTGTAATT CTTCCGGTAA TTTAAGTATT 
7251 CAGTAGTTGT ATAAAACCTT TAAAATATAT ATATTGAGAA TTTTGTGTGA 
7301 ATGAGATGAT GAGATAATCT TGTAGGATCA TTTAAAGATA AGAACTGAGG 
7351 CCTGGCACAG TGGCTCATGC CTATAATCAC AGCACTTTGG GAGGCCCAGG 
7 401 CGGTAGATCA CCTGAGGTCA GGAGTTTGAG ACCAGCCTGG CCAACATGGC 
7 4 51 AAAACCCTGT CTCTACTAAG CATAGAAAAA TTAATTGGGT GTGGTCGTGC 
7501 CTGCGTGTAG TCCCAGCTGC TTGGGAAGCT GAGGCGGGAG AATCTCTTGA 
7551 ACCCTGGAGG TGGGCATTGC AGTGAGCTGA GATTGCGCCA CTGCACTCCA 
7 601 GCCTGGGCGA CAGAGCAAGA CTCTGTCTCA AAATAAAGTA AAATAAAATG 
7 651 AAGATAACAA CTGAAATTTC ACATTAAAAA TTTTTTTGTA GCGACTGTGC 
77 01 CTCCTATGTT GTGCAGGCTG GTCTCAAACT CCTGGCCTCA AGCGATCCTT 
77 51 CCAAAGCACT GGGTGGGCCA CCATGTCCAG CCTGAAATTT TGCATTAAAA 
7801 AATTTCCCGC TTTTGGCTGG GCGAGGTGTC TCACGCCTGT AATAGCAGTT 
7851 TGGGAGGCCG AGGCAGGCAG ATCACTTGAG GTCAGTTCTA GACCGGCCTG 
7901 GCCAATGTGG TGAAACCCTG CCTCTACTAA AAACACCAAA TTAGCTAGGC 

7 951 GTGGTGGTGT GCGCTTGTAG TCCCAAGCTA CTGAGGAGGC TGAGACAAGA 
8001 GAATCGCTTG AATCTGGGAA AAAGAGGTTG CCGTGAGCCA AGATTGGCCA 
8051 CTGCACTCCA GCCTGGGTGA CAGAGTGAGA TTCTGTCTCA AAAAAATAAA 
8101 AAATAAAAAT TTCCCCCTTT AATCAAATTA AGTTAAAATG AGGGATGTTA 
8151 GACAGTTTTT AACCATCAAA TATTTTAGTT TAGTTTTTTT TTTTTAACGT 
8201 TGTCTTAAAG ATGGAAGTGC TTCAAAATCA AATCTTCCTT GCCAGTTCTC 
8251 TACTTGGCTT CTTTTTTTTT CTTTTTGAGA TAGAGTCTCA CTTTGTCACT 
8301 GGAGTGCGTT GGCGTGATCT CGGCTCACTG CAACCTCCGC CTTCCAGGTT 
8351 TAAGTGATTC TTCCACCTCA GCCTCTCAAG TAGCTGGGAG TACAGGTGTG 
8401 TGCCACCACA CCCGGCTAAT TTTTGTAGTT TTAGTAGAGA CAGGGTTTCA 
8451 CTATGTTGGC CAGGCTGGCC TCAAACTCCT GACCTCGTGA TCCACCCACC 
8501 TCAGCCAAAT TGCTGGGATT ACTTGTGTGA GCCACGCGCC TGGCTTCTAC 
8551 TTGGCTTTTA AAGGGAATTT TGCTTTCTGA GTAATTTTAT TTCTCAGGTA 

8 601 TCTTGGTCTT TTTAATTCTG GAAGCAATCT TAATAATTTA TGTATGTGCC 
8651 CTGTAATCCC AGCACTTTGG GAGGCCGAGG TGGGCGAATC ACGAGGTCAG 
8701 GAGATCGAGA CCATCCTGGC TAACACGGTG AAACCCCATC TACTAAAAAT 
8751 ACAAAAAATT AGCTGGGCGT GGTGGCAGGC GCCTGTAGTC CCAGCTACTT 
8801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
8 951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
9401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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94 51 NNNNNNNNNN NNNNNNNNNN 
9501 NNNNNNNNNN NNNNNNNNNN 
9551 NNNNNNNNNN NNNNNNNNNN 
9 601 NNNNNNNNNN NNNNNNNNNN 
9651 NNNNNNNNNN NNNNNNNNNN 
9701 NNNNNNNNNN NNNNNNNNNN 
9751 NNNNNNNNNN NNNNNNNNNN 
9801 NNNNNNNNNN NNNNNNNNNN 
9851 NNNNNNNNNN NNCCAGGCTG 
9901 AACCTCTGTC TCCCGGGTTC 
9951 AACTGGGACT ACAGGCGTCC 
10001 TAGAGATGGG GTTTCACCAT 
10051 TCAGGTGATC CGCCTGCCTT 
10101 GAGCCACTAC GTTTGGCTGC 
10151 ACATTTTGGA ATTTTCCTTT 
10201 CCGGCCCCCG CTTTAGAGGG 
10251 TCCCTATAGA AGGTGGGGCT 
10301 TTTAGGGAAG AAAATTTAGG 
10351 GTGTTGCAGA TTATTAATAG 
10401 ATGGTCCTGA GGAGCCAGAG 
10451 ACATGTGAGA GATAGGGTAC 
10501 CTTCTGTGAC AGGTTGGCTC 
10551 TAAATCCAAG GTTGTGATAG 
10601 TGTAGATTTC CTAATTAGGA 
10651 CACAAGAGGG ACCATTCTTT 
10701 GCTATTTAGC ACTTACTCGG 
107 51 TTTTTTTTTT CGAGACATGA 
10801 GTAGCACAGT CATGGCTCAC 
10851 CCTCCTGTCT CAGCCTCCTG 
10901 TTTAAGAGAT GAGATGTCGC 
10951 GGCTCAAGTG ATCCTCCCCG 
11001 GTGTGACCCA CCTCACTTGG 
11051 TGCAGTCTGC ATCTCAAACT 
11101 TATGTAATTC ATGTTTTCCA 
11151 GTATTTCCTT TGTATAATAA 
11201 TTCTACTCTC CTTATAGATA 
11251 ATTTAACCCA TTTATGGTCG 
11301 TCTGCTCCCT TTGTGTCTAT 
11351 AGGCAGTTCA CACCAAGGCT 
114 01 TTTAGGAGAT TGAAAACAGA 
114 51 TAAAATGTGA AAATCTGAGA 
11501 CCAAACTCAT TTGAGTTTTT 
11551 GTCGCTTTGT CCCCCAGGCT 
11601 CAACCTCCAC CTCCCAGGTT 
11651 TTGCTGGGAT TACAGGCTGC 
117 01 ATAGTAAAGA CGGAGTTTCA 
117 51 GATCTTGTGA TCCTCCCGCC 
11801 GTGAGCCACT GTTCCCGGCC 
11851 ATGTTAGTCT TAAAGTGGGT 
11901 CTAAATTTAT AAAAGGCCAT 
11951 GATAATTATG TAGTGCTTTA 
12001 TGAAGAGCAG TTGGATTTCT 
12051 CAAATTAGGC ATCTTCACCA 
12101 TTTTTTATTT TTATTTTTTA 
12151 TTTGGTAGAG ACAGGATCTC 
12201 TTGGCTCAAG CAGTCCTCCT 
12251 GCTGTGAGCC ACTGCGCCAG 
12301 TCTAATTTAA TTTAAGCAGT 
12351 TTCTGGGTGC TAGTTTATAT 
12401 TGTTTTTTTT TGTTTTTGTT 
12451 CTGTCGCCCA GGCTGGAGTG 
12501 CTGCCTCCCA CCCAGGCTGG 
12551 ACCTCTGCCT CCCAGGTTCA 



NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
GAGTGCAGTG GCACAATCTT GGCTTACTGC 
CAGCATTTCT TCTGCCTCAG CCTCCTGAGT 
ACCACCACGG CCAGCTAATT TTTATATTAG 
GTTGGCCAGG CTGGTCTCCA ACTCCTGACC 
GGTCTCCCAA AGTGCTAGGA TTACAGGCGT 
TTATCAGCTT TTTACCACTT TGTCGCCACT 
GAGAATTAGG CAAAATGCCC AGACTCCCCC 
AGAGGGGAGC AATTAGACTA TTCCTTTGTT 
GAGATTACTG CTTTGATATC TGGAATGTAA 
TCTTGGCCTT TCTTTGGAAC CACCCTGGGA 
GGTAATGGTG GAATGATATT CAGGGGAAAA 
AACTAAGTGT TAGTTTGTTG GCTGACTGAA 
AGAAGAAGTA GGAAATAGTT TTCCTTGGTA 
AATTGGCTGG AACACCCTAC ACTGCTTTAT 
GTTCCAGTTA AGTTTACTGT GTTCTATGCT 
CAAGTAGTGT TAAATATGCA TGCCTTTATT 
TGGAAACATC ACTTTTTAAT AATACTAGGT 
TGCCAGCCAC GTGGCTATGG TTTTTTTTTT 
TCTAGCTCTG TCTCCCAGGC TGGAGTGGTG 
TGCAGTCTCA ACCTCCTGTA CTCTAGTGAT 
AGTAACTGGC ACCATGCCTG GCTAATTTTT 
TATGTTGCCT ATGCTGGTCT CGAACACCTG 
CCTGAGCCTC TCAAAGTGTT GGGATTACAG 
CCATCTATGG TCTTTACATA GGGCATTTTG 
AGTGATCTTC AACAGTGAAA CTCAGTGAAT 
AGAACAATGA TGGATTTAAT TTCTCTGAAT 
TAGTACTTAA GTGGAATTAC TCTTTGTCCT 
TTTTCTGGTA TCTTGATTTG GGACTGTTAC 
TGTAGCCATA CTCACGTTAC ATTTGATGCA 
AT ACT CAT AT AACATTTTGC ATAAAGTTAT 
GTTCATGAAC CTCAGATTAA GAATACTTGA 
AAAGAGAATG TTAACTATCA TTATCAATAT 
GTGACAAAGC TTAGCTTTAA ATCTGGTATC 
TTTTTTTTTT TTTTTTTTTT GAGACAAGGT 
GGAGTGTAGT GGTGTGATCT TGGCTCACTG 
CAAGTGATTC TCCTGCCTCA GCCTCTGAAG 
GCCACCACGC CCAGCTAATT TTTTGTATTT 
CCTTATTGGC CAGGCTGGTC TCAAACTCCT 
TCGGCCTCCC AAAGTGCTGG GATTACAGGT 
TAATTTGAGT TTTAAAATGT GGAGTTTAAG 
TAGATGAAAT TTATAAAAAT AGTCAAATAG 
TTGAAACAAT TTTGTGAAAT ATATAATGTG 
TGTGTAGATT GGTGGTTAGC ATCTGCCTGA 
TACTTACTAA AGCTAGTGAA ATCTGAACTC 
GGCTTTTTTG AGCCGAGCTA ACTTACTCTC 
ATTAATTAAT TTTTTTTTTT TTTTTTTTTT 
CCCATGTTAC CCAGGCTTGT CTCTGGCTCC 
ACCTTAGCCT CCCAAAGTGC TAGGATTACA 
GCTGAGCTTA TTCTCTACTA ACACAAGTGT 
GAATCACACT TTTCTTTGTA TTTGGTCAGG 
ATGATTTGAT TCATTCTGAT AGGGTTTTTT 
TTTTTGTTTT TTTTGAGACA GAGTCTAGCT 
TGGTGGCTCG ATTTCGGGTC ATTGCAACTT 
AGTGCAGTGG CTCGATTTCG GGTCATTGCA 
AGCGATTCTC CTGCCTCAGC CTCCTGAGTA 
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12 601 GCTGGGATTA CAAGCACCCA 
12651 TAGTAGAGAC TGGGTTTCAC 
12701 ACCTCAGGTG ATCTGCCTGC 
12751 TGTGAGCCAT CACACCAGGC 
12801 CTCACTCTGT CACCCAGGCT 
12851 CAGCCTGGAC TTCCCAGGCT 
12 901 TAATTAGGAA TATAGACACA 
12 951 TTCTTTTTTC TCTTTTTTTG 
13001 CTGGTTTTGA ACTCCTGAGC 
13051 CGTGCTGGGA TTACAGGCAT 
13101 AAAACTTTTT ATTAAAAAAC 
13151 TATACATAGG TACACACATG 
13201 AAGAAAATAG CCCTTTGGGC 
13251 GCACTTTGGG AGGCTGAGGT 
13301 ACCAGCCTGG GCAACACAGT 
13351 TCAGCTGGGT GTGGCAGCAT 
134 01 GAGGCAGGAG AACTGCTTGA 
13451 GATTGTGCTA CTGCACTTCA 
13501 AAAAAACAAA CAAACAAACA 
13551 GAGTATATTA AGAGTTGAGT 
13601 TTTATTTTAA ATTACAAAAC 
13651 CTCATGTTCA AACAAGCATA 
13701 TTGTAATATG TTTACGGAAT 
13751 GAAGGGGCCT ATAATTTCTG 
13801 ACTGGAGAGG AACAGAGCTC 
13851 TCTGAGTGGG GTCAGCAGCA 
13901 GTGTACATTA TTTCCCTTAA 
13951 CTAGTTTTAC AAACAAGGAA 
14 001 CCAAGGGCAT TCAAAGTAAG 
14051 TGGATGTCCC ACTACAACAA 
14101 TACTTCTAAG AATTTAACAT 
14151 AATCTCAGCA CTTTCGGAGG 
14201 GTTTGAGACC AACCTGGGCA 
14251 AAAAAAAAAA CTAGCTGGAC 
14301 TCAGGAGGCT GAAGTAGGAG 
14351 AGTGAGCCCA CATTGCATCA 
144 01 GCCTATCTCA CACACAAAAA 
14 451 GGCACTAACA TTTTGTGGTC 
14 501 TGATGTGGTG GCTCATGCCT 
14 551 GGAGGATTGC TTAAGGCCAA 
14 601 GATCCCCTTT CTGCAAAAAA 
14 651 ACCTGTAGTC CCAACCACTT 
14701 CCAGGAGTTT GAGACCT GGG 
14751 TAAAAATAAA AAATAGCCAG 
14801 CTCAAGAGGC TGAGGCAAGA 
14851 CCGTGAACTG TGATTGCACC 
14 901 ACCCTGTCTC TGTGGTGGTG 
14 951 AGGGTTTGTG AGGTATGTAT 
15001 TCACATCTTG GGTTGAAATT 
15051 GCTATATATT TAAACTTGGA 
15101 AAAGCAAATG AAGCAGAGGG 
15151 GCTGCAGAGC ATGTTACTGT 
15201 ACTTGTATGA TCATAGTACT 
15251 TAACTTTAGA TTATATATGT 
15301 CTGAACAGAA GCCTTGGATA 
15351 GAAATTTGTA TTGTTATCTC 
154 01 AATGGCCACT TACTGTGCTT 
154 51 TTTCAGAAAC GTGATTTGTA 
15501 TGGGTGCTAA TTGAACCTTA 
15551 AGTGAATAAC TTGCACCAAG 
15601 ATTCAAGTAC GGGTCTTCCA 
15651 AGGAGGTAGA GAACGTTGAC 
15701 ACTTGTATGC AT AT T AAAG A 



CCACCATGCC CGGCTAATTT TGTGTATTTT 
CATGTTGACC ACGCTGGTCT CGAACTCCTG 
CTTGGCCTCC CAAAGTGCTG GGATTACAGG 
CTCAAGAACT TTTTATTTTT GAGACAGGGT 
GGAGTACAGT GGTGAGATCA TGGCTTACTG 
CTGGTGATCC TCCCATCTCA GCCCCTGGAG 
CACCCATGCC TGGCAGTTTT TGTATTTTTT 
TAGAGACTGG GTTTCACATG TTGTATCAGG 
TCAAGCAATC CTCACTCTTT GACCTCCCAA 
GAGCCACTGT ACCTGGCCTT TTCTACATTA 
CCAAATCTTC CTTGTGGTTG TATATACATA 
GAGAATTTTA CCTTGGAGGA AGGCTTGGTA 
CGGGTGCGGG GGCTGACGCC TGTAGTCCTA 
GGGCGGATTG CCTGAGCTCA GGAGTTCAAG 
GAAACCCTGT CTCTACTAAA ATACAAAAAA 
GTGCCTGTAG TCCCAGCTAC TTGGGAGCCT 
ACCCGGGAGG CAGAGGTTGC AGTGAGCCGA 
GCCTGCGCGA CAGAGCAAAA CTCTGTCTCA 
AAAAAGGAAA ATAGCCTTTC TCTATCATCA 
TTTTTTTTCT GTTTTTTAAA ATTTTTGTTG 
ATGGACTCTG CTTACAAATT AAGAAAATGA 
ATCAATATAA CAGTTAATAC AAGTTAAATA 
AGCATGGCAA AATAGTGCAA AAGATTTGGG 
TTAACAGAAA GTTTTAGTTA TGTTGATTCA 
CCAGAAGGAC TCCAGAACAC TTGATGCTTG 
CTGAGTTCCC ACCAGCCAGA AAGTTTGTGT 
CTGCCACAAT AATCCCATGA AGAAAATGCC 
ACAGAGGCAG AGAAGAGTTA AATGACTTGC 
CAACTGAATT GGAATTTTAA CTCAAAGGCT 
ATAGGCTGTT TCTGCTTTAC TACATGTGCT 
TTTAGGCTGG TTGTGGTGGC TCACTCCTGT 
CTGAGGTGGG TAAATCACTT GAGCTCAGGA 
ACATGGTAAA ACCTCATCTC TACCAAAAAA 
GTGGTGGCAC GCGCCTGTGG TCCCAGCTAC 
GATCGTTTGA GCCTGGGAGG TGGAGGTTGC 
CTGCACTCTA GCCTAGGTGA CAGAGTGAGA 
AAAGAATTTA AAATTTTAGT CAAGTAATTA 
AGTTACTTTA CGAATTCATG GTTGGAGGCC 
GTAATCCCAG CACTTTGGGA GGCTGAGGCA 
GAGTTCAAAT CAGCCTGAGC AACCTAGTAA 
TTTAAAAATT AGCTGGGCAT GGTAGTGTGC 
GGGAGGCTGA GGTGGGAGGA TTGCCTGAGG 
CAGCATATGA AGACCCTGTC TCTAAAAAAC 
GTGTGGTTGG TGTGCTTGTG GTCCCAGCTA 
GGGTTGCTTG AGCCCAGAAG TTGGAGGCTG 
ACTGCACTTC AGCCTGGGTG ACATAGCAAG 
GTGGGTGGGG GTGGGGGAAG GGATTTAAGA 
TATTTATAAA TGGGCTTTTA ACTTTACCCT 
AATTGTATCC ATTCTCAGTT TTTCTGTCTT 
GACTTAGAGG TCATGGATGT CTTTCTATGA 
CTGCCTTCTC TTGCTGTAGA GGGCACACTT 
TTTATGCATT GCTAGGCTTT GGGAGTTGTG 
TACAACTATT AGTTGGCAAT TTTTAAACTT 
AAACTCCTGT GTTCCTTTGT CACTGATAAT 
AATAATTTTG AAGTTTTTGT CTGAACCTCT 
ATGGTTTTGC TGGGAGGAAG GAGAAATAAC 
CTGTATGTGC CAGACAGTAT GTGCTAGATG 
ATCCTGACAA GAAGCCTAAT TGGGTGGTAG 
TAGATGAGGA AATTGAGGCT CATGGTGGTA 
ATCCTATGGC TGGTATGCAG TAGAGCCTCA 
GGTCCAAACC CATGCAGGCT TTGAGAGGTA 
ACCCCCTTCT TGGTGTGTTT TTCAGCAAAT 
CTGTCTACCC TTTTGTCATC TTGTGTCACT 
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15751 TGCTGCTTCC TTTGGTACTA CCCAAATTTC TTTCAGCATT TCAGCTTTGA 
15801 ATTTTTATTT TTATTTTATT TAATTTATTT ATTTTTTTGA GATGGAGTCT 
15851 CACTCTGTTG TCCAGGCTGG AGTGCAGTGG CGTGATATCA GCTCACTGCA 
15901 ACCTCTGCCT CACAGGTTCA AGCAATTCTT CCTGCCTCAG CCTCCTTAGT 
15951 AGCTGGGACT GGAGGTGCCC ACCACCACGC CCAACTAATT TTTGTATTTT 
16001 TAG TAG AG AT AGGGTTTTAC CTTGTTGGCC AGGCTGGTTT TGAACTCTTG 
16051 GCCTCAAGTG ATCCACCCAC CTCGGCCTCC CAAAATGCTG GGATTACAGG 
16101 CATGAGCCAC TGCACCTGGC CAGCTTTGAA TTTTTAGAAT ACTGTTCTAA 
16151 ACAGAACTAT ATTGGAACCT GGAAAATTAA TCTATTGTCT CTAAATACCA 
16201 AAGAAAAACA TGTAATTTTA GTGGTTGATT ATGGGAACAA TTTTTTTTAA 
16251 GATGGTTCAT CTGAATGGGA AGCATTTTTT TTTTAATTGC TTGACTATTT 
16301 CTTTAAATTT GGAGAAAAGA CCATTGCCCT CTCAGATTTC TGGTAATTGG 
16351 TCACATTGAT CAT T TAT ATT GACTGACAGG CTGCTTTGTC CACAGCTGAA 
16401 GGATTGTTTA ATTTTTTTTA AATTATAAGA GTAATATGTG CTCACTGTAA 
16451 AATTCACAGT ACAGAAGCAT ATGAACTAAC TAAAAGTTCT TACCTCTTGT 
16501 CTCCAGCAAG GAGTAAGTGT TTCAACCTGA AGGTTGGTTT TGAATTGTGT 
16551 TCTGTGGAGC GTACTTAAAG TGAGTGAAGA AGAAAAATTT ATGTCAATCA 
16601 TGATCATTGC AGCTGAAGTT TTTATTGTTT CACCCCCTAA AGGTTATTAA 
16651 AATAGTATGT AGTTTAGTAG TCTTGATAAT TTTCCCTTAA GATTTATTGG 
16701 CCAGTATATC AGGATTTTGT TTTAAATTTG ATATGTGAGC TTAGTTTTAT 
16751 GCTATTTTCA AATAAGACAT TTAGAAGAAG ATAAAATAAC ATTCCTGTCT 
16801 TAGTCTGTTT TCTGCTGCTA TAACAGAATA GCACAGACTG GGTAATTTAT 
16851 AAAC AG T AG A AGTTTATTTG GCCTGTGGTT CTGGAGGCTG GGAACTTCAA 
16901 GAGCATGGTT CTGCCCTTTG TGCTGTGTTA TCATATGGTG GAAGGTGGAA 
16951 AGGCAAGTGG GTATGTCAAG AC AG AG AG C A AGAAGGGGCT TGAACTCACT 
17001 TTTATAACAG AGTGACTCCA GAGATAGCTA ACCCACTTTT GAGAGAATGC 
17051 ATTAATCCAT TCATGAGGGC AGAGCCCTTG TGACCTAATC ACCTCTCATT 
17101 AGGCTCTGCA TCCTTAAACT GGTTTTTTTT TGTTTTTTTT TTTTGAGACG 
17151 GAGTCTCGCT CTGTTGCCCA GGCCGGACTG CGGACTGCAG TGGCGCAATC 
17201 TCGGCTCACT GCAAGCTCCG CCTCCCGGGT TCACGCCATT CTCCTGCCTC 
17251 AGCCTCCCGA GTAGCTGGGA CTACAGGCGC CCGCCACCGT GCCCGGCTAA 
17301 TTTTTTGTAT TTTTTTAGTA GAGACGGGGT TTCACCTTGT TAGCCAGGAT 
17351 GGTCTCGATC TCCTGACCTC ATGATCCACC CGCCTCGGCC TCCCAAAGTG 
17401 CTGGGATTAC AGGCGTGAGC CACCGCGCCC GGCCCCCCTT AAACTGTTGT 
17451 ATTGGGGATT AAGTATCTAA CACAGGAACT TTGGAGGATA CATTTAAACC 
17501 ATAAGAATTC CTGTCATGCA AATGAATCCA TTCTAGATGA AAGAGAATGA 
17551 ATTTAGTTTC CATTGAACTT TATAAATAGG CCTTTTCTAA GGTACTTACA 
17601 GCTGATATTA TAAAATTTAT ATTTGTTTTT ATAAATTTGT ATTTGTATTT 
17 651 CTGTTTGTAC AAATACAATT ATACACTATA GTTCTCTGCT GTTAGATTTT 
17701 TTTTCTTCCT TAGCATGTTT CCAAAGGGTG GAATGTTGAA AGTTGGGTTA 
17751 ATGTCAATCA GCTTTCTTTT GTAAAGTGTT CATTGACATG TGAACCTTGT 
17801 CTGAGAATCT AAATTTTATT TCATGAAAGA AGAAAACAGT ATATTCTCAT 
17851 TTAACCCAGA ATTTAACTTC ATATACTTGT GGCTGTATTG GGAGTATGCC 
17 901 ATTGCTGTCT GTTTACAACC TGACCTACTC TACCTACTTA GAAGTAATTT 
17 951 GTGTTATGAT AGGTGTGCTG TGCTGACATA TGCTGAACAT ATTTGTAAGG 
18001 GTGTTAAGTC ATTGAATAAA ACGCTTTTCT CCTCCTTTCA AATAACATTT 
18051 TTTATTTCTG GTTATAAAAG TCATACAAGC TTACTGCAGG TTGTTAAAAA 
18101 GGTATAAAGA AGAAACCGTC AATCCATTAT AATCCTACAG TTTAGACTTC 
18151 CTGCTCCAGC CTCTCAGAGT GCTGAGATGA GCTAGCCATG CCCAGCCCCT 
18201 CAAAAGATTT TTTAAAAAAC AAAAATGAGG TTATACTTTA AAAAATTCTA 
18251 TATTCCTTTC ACATAACAGT GTTATTTTGG AGGTTTTAGA ATTTCCAGTA 
18301 GCATTTTAGA T T C AG AAAC A AGCTGATTCA TCCTCTACTT TGTACTTTAG 
18351 GCAAGAAAAG AATTTTACCT AAATAGAATT TTGAACTGAA AATCTGTTTT 
184 01 TCTAACTTTT TATTTAAAGA ATATTGTTCC ATGCTTTCAC AGTAGTGACT 
184 51 TTTAATTTTT ATATTTTTTA TTTTATTTAT TTAGAGATGG GGGTCTCACT 
18501 CTTGTTGCCT AGGCTAGAGT GAGTGCAATG GTTCTATTCC TAGCTCACTG 
18551 CAACCTTGAA CTCCTGGGCT CAAGTTACCC TCCTGCCTCA GCCTTCTAAG 
18601 TAGCTGGGAC TACAGGTGTG CACCACTGCA CCAGGCTTTT TTTAAAGGCA 
18651 TAGAAAATGG TAGTGCTTGC ATACAAAAAT GGCGTAGGTA CAT AC AT C AG 
18701 CGGACATCAA GACTATGTTC AGATCATAAA TGTACATATA TGTACCGATG 
18751 CCATTTTTGC ACGCAAACAA ATAATGGAAA TTGAACTCTA AACTGAAATT 
18801 TGAAACAAGG GTTCTGGGGT GGGCCCTCTT GCTGATTTGT AATTGAATGT 
18851 ATAGTTCAAT TTTTCCCCAT CTGTTAAGCA AAAGACAATT CTAATGTTAG 
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18 901 CAAAAATCCA CATATCCTGT CAT T GAT CAT TTTTTCCTTA ATTTTCTTTA 
18951 AGAGATGGGG CTTCTCTCTA TGTTGCCCAG GCTGGTCTGG AACTCTTGGG 
19001 CTCAAATGAT CCTCCAGCCT CAGCCTCCCA AAGTGCTGGA ATTAATAGGC 
19051 ACAAGCTGCT GTGCCTGGCC CTGTCATCAG TCATTTAACT TCATGCAAAC 
19101 TGAGTAGAAT AAAACTCGTC CTTACTGTAC CTTATTGCTT TTGTTTTATT 
19151 GTTGGAACCT CCAATATTGC GAAAGTAGAC CAAAAGTTGA CTTATAGGAA 
19201 AAACTGATAG CAAAAATAAT TTTTCTCTTG TTGCTGTATT TCATGCCCAC 
19251 CATCCAGTTG TTAAAGCCTA CTGTTAATTT CTCTCAGCCT CCTCCTTTCT 
19301 GTCCAGGCTT ATTCTATGCC ATTCTTACCT TAACTGTTTT TAGCTTTCTC 
19351 ATAGAGTGAA CTTTTTAAAT TAAAATAAAA TATCTGCTCG TAGTATTATA 
19401 AAATTCAAGC AGTTCAACAG AATTTTTCAC TAATAGAAAT ACTTGTACCT 
194 51 CAAAAGCAGC TTTATTTTAC AAACCCAGCC CAATTTGTGA TTAGATTTAA 
19501 CTTGAGAAAA CATGAAATGT CTCTCATATT GTTTAAAAAT ATCATAAGTG 
19551 GCTGGGCACG GTGGCTTATG CCTATAATCC CAACACTTTG GGAGGCTGAG 

19 601 GCAGGTGGAT CACTTGAGGT CAGGAGTTTG AGACCAGCCA GGNNNNNNNN 
19651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
19951 NNNNNNNNNN NNNNNNNNNN NNNNNNNTTC ACCATGTTGG CCAGGCTGGT 
20001 CTCAAACTCC TGACCTCAGG TGATCCACCT GCCTGGGCCT CCCAAAGTGC 
20051 TGGGATTATA GGCTTGAGCC TCGCCTGGCC TCCTCATAAT TTTTTAACCT 
20101 TTATAAAAAC CTTTTCTAAA ACCCTTTTTA TTTTGAACTA AATTTAGATT 
20151 TACTGAAATT GTGAAATCAA TGTGGAGTTC TTGTATACCC TTCTTTCCGC 
20201 TTTTCCTAAT AGTAACATCT TACATACATG GTACATTTGT CCAAATTAAG 
20251 AAATAAACAT TGGTACAGTG TTAACTATAG ACTTAATCTG GTTTCTCTAA 
20301 TTTTTTCACT AATGTTCTTT TTCTGTTCTA GGATCTAATT CAGTATACCA 
20351 TATTGTATTT AGTTGTAGGC CATGTTAGCC ACCTTCAATC TGTGACAGTT 
204 01 TCTCAGTCTT TCCTTCTTTT TCGTTATCTT GACAAGTTTG AAGAGTGCTG 
204 51 ATAGGTATTT TATAGAATGT CCGTCAGTTG TCTGTCAGTT TGTATTTGTC 
20501 TGATGTATTT TTTTTTTTTT TTTTGAGATG GTGTCTCGCT CTGTCGCCTA 
20551 GGCTGGAGTG CAATGGCATG ATCTTGGCTC AATGCAGCCT CCACCTCCGG 
20601 GGTTCAAGTG ACTGTCCTGC CTCAGTCTCC CAAGTAACTG AAACTACAGG 
20651 CATGTGCCAC CACGCCTGGC TAATTTTTTG TATTTTAGTA GAGAAGCAGT 
20701 TTCACCGTGT TGCCCAGGCT GGTCTCGTGC TCCTGAGCTC AGGCAATCCA 
20751 CCCGCATTGG CCTCCCAAAG CGCTAGGATT ACAGGTGTGA GCCACCATGC 
20801 CTGGCCAATA TTTTGAGGGA TATACTTTGG TGAGGTCATG CAGATATCCT 
20851 GTTTCTCCTT AGTTTTATCG ATTAATTTAG CATTTATCCA GTAAATCTTC 

20 901 CTTGCAGCAA TTATTTTTTC TTTTTCTTTT TTCCTTAATT TTTTTTTTAA 
20951 GAGATGGGAT CTCACTCTGT TGCCCAAGTT GGAATGCAGT AGTGAGTTCA 
21001 TAGCTCACTG CAGCCTCAAA CTCCTGGGCT CAAGTGATCC TTCTGCCTCA 
21051 GCCTCTCAAG TAGCTGGGAC TACAGGCATA GACCACCACA CCCAGCTAAT 
21101 TAAAAAAAAT ATTTTTAGAG ATGGGGGTTT TGCTATGTTG CTCAGGCTGG 
21151 TCTTGAACTT GCTGGCCTCA TGTGATCCTT CTACCTCAGC CTTACAAGTA 
21201 GGTGGGAATT ACAGGTGTGA GCCACCACAC CCAGCATTGC AGCAATTATT 
21251 AATGTAGTGC TACTGGTCAT TTTCTGTTTT TCTCATTTCT TCAGCATGTG 
21301 TTATTGACTT GTCTCTTCCC TCCCATTTAT AATCATTTAT ACTGCTATGA 
21351 ATT CAT GAG T ATTTATTTTG TGAGTTATAA TCTAATACGT ACTTAATTTA 
214 01 TTTTGTGCCT CAAATTGTTC TGGCTTGGCC ATTTTTTTTT TTTTTTTTTG 
21451 AGACGGTCTC GCTCTGCTGC CCAGGCTGGA GTGCAGTAGC GCCATCTCTT 
21501 CTCACTGCAA CCTCCACCTC CCGGGTTCAA GCGATTCTCC TGCCTCAGCC 
21551 TCCTGAGTAG CTGGGACTAC AGGCGTGTGC CGCCACACCC GTCTAATTTT 
21601 TTGTATTTTT AGTAGAGACA GGGTTTCACC ATGTTAGCCA GGATGGTCTC 
21651 GATCTCCTGA CCTCGTGATC TGCCCGCCTC AGCCTCCAAA AGTGCTGGGA 
21701 TTACAGGTGT GAGCCACCAA GCCCGACCGG CTCCTGTATC CTTTTAACAT 
21751 GAGGTGCTGT CATCATTTTT TCCCCCTAAT ATTTTGGCCA AAAATGTTAA 
21801 TCAAGGATGG CACAAATTTT CTGTAGCTGT ATCTCACAAT GAAAGAGGCC 
21851 TGATTAAAAA TGTAAAACTA AAATGTTCTC TGATCTCTTA GCACATGCTT 
21901 TGTAAAAGGC ACAGTGCTAG ATCCTTGTAT ACGTAGATGA GTAAGTCAGC 
21951 TTACCTTCCA CACCCACAGA TAGCTATGTC AAACGTAAGG GTGGAGAAAC 
22001 ACAGACCCCA AACTTCTCGA GGGTAGAAAA TATGAGGTTA TAGTAGATTA 
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22051 GAACTACAAA AAGCTAGAGG 
22101 TTACTAGAAT AATTTACGAG 
22151 TTTTTTTCCT TTCTCTTTTT 
22201 CCCAGGCTGG AGTGCAATGG 
22251 CCTGGGTCCA GGTGATTCTC 
22301 CAGGCATCTG CCACCATGCT 
22351 GAGACGGGGT TTCACCATGT 
22401 AGGTAATCCA CCCACCTTGG 
224 51 GCCACCGCGC CCAGCCAAAT 
22501 GTTTTTTTCA CTTAAGTCAA 
22551 GATCCAAATT CATGAGGAAT 
22601 TTGCTAAATT AGTCTTGGCT 
22651 GTAATTTTAT ATTTGTATAT 
227 01 TTGTAAATTA TAAAAACGTT 
22751 CGAATATTCA GTATATTTAC 
22801 TAATTTAAAA TGTCCCAATG 
22851 AGGTGTGTGT CTTTGATAAG 
22901 TATTTGCCTT TCTCATGTGA 
22951 CGAGAGAACC AGTAGTCTTT 
23001 TTCTTCATAT TATTTATAAT 
23051 ACTCATAAAT AATTTTTTTA 
23101 ATTGTCCAGG CTGAAGTACA 
23151 GCCTCTCGGG TTCAAGTGAT 
23201 ATTACAGGCA TGCGCTACCA 
23251 ACAGGATTGC ACCATGTTGG 
23301 TGATCCACCT GCTTCGGCCT 
23351 GCCACTGTGC CCAGCCATAA 
23401 AACTTAAAAA AATGTAGTGG 
23451 ATTAATTTCT TGAAACCATA 
23501 CATGTTTCTT TCTTTCTTTC 
2 3551 TCTTGTTGCC TAGGCTGGAG 
23601 CTCCTGGGTT CAAGCAATTC 
23651 TACAGGCGCC TGCCACCACA 
23701 CGGGGTTTCA TCGTGTTGGC 
23751 GATCCACTGC ACCTGGCCCC 
23801 TCTGAAATAG AGTTGTTGAT 
23851 CCCGTGCTGG AGTGCAGTGG 
23901 CCTGAGTTCA AGCAATTCTC 
23951 AAGCTGCCCA CCACCATGCC 
24 001 GGGTTTCACC GTATTGGCCA 
24 051 GAGCCACTAC GCCTAGCCTG 
24101 CAACATCTGA GGCCCAGTAC 
24151 CAAATTCAGG GAATGTTTTT 
24201 CAGGGAGATT GGTACATTGA 
24251 AAAACCTCCC TACTAAGTCC 
24301 CACTTGAACA CCTGGAGGCA 
24351 GCACTCCAGC CTGGGCGACA 
244 01 CCCCCCAAAA AACCAACCTA 
244 51 CCTTTCTTCA TACAGGTAAA 
24501 GTGTTCAGTA TTCTTAAGAG 
24551 TGTACTCCCA GCACTTTGGG 
24 601 GGAGCTTGAG AACAGCCTCC 
24 651 ATACAAAAAT TAGCTGGGTG 
24701 TCAGAGGCTG AGGCAGGAGA 
24751 GTGACCTGAG ATTGCTTCAC 
24 801 TCTTGTCTCA AAACAAACAA 
24851 GTGGCTCACA CCTGTAGTCC 
24 901 CACTTGAGAT CAGGAGTTGG 
24 951 GTCTCTACTA AAAGTACAAA 
25001 TCTCAAAAAA AGAAAGAAGA 
25051 CAATAGTAGT AAAGTCAGCC 
25101 TTAGGTATCA ATCATTTATT 
25151 TAAAAATTTT AAAATCTCAA 



AAGTTCTGAA CTGGAAACAG TGGATAGGAT 
GGTGACAATT GTAAATCTTC ATAGGTTTCT 
TTTTTTTTGA GATGGAGTCT CGCTCTGTTG 
CGCAGTCTCT CCTCACTGCA ACCTCCGCCT 
CTGCCTTAGC CACCCAAGTA GCTGGGATTA 
GAGCTAATTT TTGTATTTTT TTTTTTAGTA 
TGGTCAGGCT GGTCTTGAAC TCCTGACCTC 
CCTCCCAAAG TGCTGGGATT ACAGGTGTGA 
TTTTATTGGT TTCTAAACTA GCGTAATTTA 
AATTATATTA TTGTAGGATA AAAACTTAGT 
GAAGAATAAA TACATTTAAA GTCTTACCAT 
CTTTGTACCA AAATTCTGTC CTTGTGCTCT 
TTTCTATCAA CATTTTTACT GTGTGGTGTT 
TTAAAGCAAA CTCAGAACAA TGAATTCTCA 
AGTTGAGAAA TAAACTACTT CTGTAGTAGG 
CAAGTTAACG TGTCACTGAT CACGCTATTC 
GGGAGGTGGG GAAGTTTGTG GGTTTGATTT 
CTGTTGTCAT GTTAGTAAAC AAATGGTTTG 
TGCAAAGATT GTCTTATACA GAGCACTCAA 
GGCTTTAATT TAAGCCTTAA ATT ATT AG AA 
TTTGTTTTTT TGAGATGGAG TTTCGCCCTT 
AT GAT GT GAT CTTGACTCAC TGCAACCTCC 
TCTCCTGCCT TTGCCTCCCA AGTAGCTGGG 
TGCCTGGCTA ATTTTGTATT TTTAGTAAAG 
CCAGGCTGGT CTCGAACTCC CAACCTCAGG 
CCCAGAGTGC TGGGATTACA GGCTCACTGA 
TGCGTTAAAA TAAGAGTGTT ATATTTGTAA 
TTGAAAAAGG TAATTTAAAA AGAATTGACT 
ATGTAACTTG TAGTGCAATT AGGAAACCTT 
TTTTTTTTTT TTTTTGAGAT GGAGTTTTGC 
TGTGTGATGT CAGCGCACTG CAACCTCTGC 
TCCTGCCTCA GCCTCCCGAG TAGCTGGGAT 
CCCAGCTAAT TTTTGTATTT TTAGTAGAGG 
CTGGCTGGTC TCGAACTCCT GACCTCAGGT 
CGTTCATGTC TTTTAAAGCT TTATGGTTGC 
TTTTTTTTTT TTTTTGAGAC TCCTCTTTTG 
TGTGATCTGA GCTCACTGCA ACCTCCACCT 
ATGGGTCAGC CTCTCAAGTA GCTGAGATTA 
TAGCTAATTT TAGTATTTTT AGTAGAGATG 
GGGTGGTCTG GAACTTCTGA CCTCAGGCAT 
GGTTGTTGAT CTTTAAGGTG ATACTTCAGG 
AGTCCTTTAC TTCAACTGGC TCCAGTACAG 
GAGTGTTTAC TGGATGCCTG GCGTGGAGTT 
GTCCAGTTGT TGTGTTGAAA CTTCTGTTTA 
CAGCTACTCA GGAGGCTGAG GCCTGAGAAT 
GAGGTTGCAG TGAATCGAGA TCGAGCCACT 
GAGTGAGACT GTCTAACAAC AAAAACAACA 
CTATGGTAGT ATCAATGCTG TGATAGTCTT 
TTCTTAACAT AT ACT CAT TG TTAATGTTCA 
TATTTGGGGC CAGGCACGGT GGCTCATGCC 
AGGCTGAGGT G AG C AG AT T A CCTGAGGTTA 
AACATGATGA AACTCCCGTC TTTACTAGAA 
TGTTAGCACA TGTCTGTAAT CCCAGCTACT 
ATTGCTTGAA CCTGGGAGGT GGAGGCTGCA 
TGCACTCCAG CCTGGGCAAC AGAGCGAGAC 
ACAAAAAAAG AATATTTGGG GCCAGGCATG 
CAGCACTTTG GGAGGCCAAG GTGGGTGGAT 
AGACCAGCCC GACCAACATG GCTAAATCCC 
AATTAGCTTG AGCAACAGAG CAAGACTCTG 
ATATTTGGTT TAATTAAGAA GGAACCTTAT 
AGCTGAACTG CCAAGTACAA ATTGTTGGTA 
AAGGATAATA TTCTACAATA GCGATCTTTT 
ACTGGAAAGG ATGTCTAGTT CATTCTATGC 



FIGURE 3, page 8 of 21 



25201 TTCAGTCCCC TCTTCTGATT 
25251 CTGACTTCTA TTTTGCTGCT 
25301 TTTCCTCATA TATAATTAAA 
25351 GTAATTTGCT AAAACCCAAA 
254 01 AAACCTACGT AATAGTTGAA 
254 51 TTATATCCTC TCTTCTTTGG 
25501 TATGTCTTGT TTACATGTAA 
25551 TCACTTTAGC ATATTTTTGA 
25601 CATGTATATC TTCTACAAAA 
25651 TTTGAAATTG CTTAAAAGAA 
25701 ATCTATGAGG GATTTCCTTG 
25751 AGTTCCATGA AGGAATCTCT 
25801 GTTAGAGATA CATAAAGAGA 
25851 ACACCTGTAA TTCCAGCACT 
25901 TCAGGAGATC GAGACCATCC 
25951 AAAATACAAA AATTAGCTGG 
2 6001 ACTCGGGAGG CTGAGGCAGG 
2 6051 GCAGTGAGCC GAGATCGCGC 
2 6101 GACTCCGTCT CAAAAAAAAA 
2 6151 CTTTAATACA AGTGTAGAGT 
26201 CCTGTGAATT TATCACATTT 
2 6251 TATGAACATT GTTTTGACGG 
26301 CGTGAGAACC CCTTCATTGG 
26351 ATTTTTTTTT TTTTTTTTTT 
2 6401 TCTTATTGCC CAGGCTGGAG 
26451 TGCAACCTCT GCCTCCTGGG 
2 6501 GGTAGCTGGG ATTACAGGCA 
26551 TTTTAATGGA GACGGGGTTT 
2 6601 CTGACCTCAA GTGAACTGCC 
26651 AGGTGTGAGC CACTGCGCCT 
26701 AATATTTCAT GGGCAAAAAC 
2 6751 CTATAGAGGA GTGGGATTTA 
26801 CAAGAGTGCG TAT C ACT TAT 
26851 AGTTCACTGA TGTTAGAGAT 
2 6901 TTTTATCATG TTTAGTGATC 
26951 TGGCTTAAGC CATGCATGGG 
27001 TTGTTATTTG TAGCTGTGTT 
27051 TGTGCTTTTT ATTAACATCA 
27101 TTCAAGTTAC TTCTGATTGG 
27151 TCTTAGGTTT GCAGTAAGTT 
27201 CAATTAATGC TATGTATGTT 
27251 TCAGAATAAG TTAATTCTTC 
27301 TACATCCCCA GTGGTATCTT 
27351 TTAAAAGAAA TTCAGTGAAG 
27401 TTTGGTCAGA GTCAAACAGC 
27451 TAGTTGGTGG CCTGTGTTGA 
27501 TAAACTTTAG TAGAATAAAA 
27551 TCATGCCTGC AGTCTCAGCT 
27601 TGAGTCTAGG AGTTTTGAGA 
27651 TCTGAAAAAA AAAGTTAATA 
27701 GTTGGCCAAT GGCATAGTTC 
27751 TTGACTTAAC AGAATCCACA 
27801 TAAATTTAGC ATGTTTCCTG 
27851 CAGCACTTTG GGAGGCCGAG 
27 901 ACCATCCTGG CTAACACGGT 
27 951 ATCAGCTGGG TGTGGTGCCA 
28001 TGAGGCAGGA GAATCGCTTG 
28051 AGATGGTGCC ACTGCACTCC 
28101 AAAAAAAGAA AAAAAATAAA 
28151 ATCATTGTTT CTCAGAGCAT 
28201 CCTCTCAGGT AG CC CAT GAG 
28251 AGTATTTGTA AGGAAAT AT T 
283 01 TTTGCGTTTT TCAGTGATGA 



TACTTGTTTA GAAGATTTTT GTTTCCTTCT 
GACTGGCACT TGGGATTTTT AAAAAATTAT 
GACAATAAGT ATAACAATAA GTATAATATG 
CAATGTTTTA AGTAATGCAT ATCATTATGT 
TATTCACAAA GATAATCGCT TATAGAAGTT 
CAGTGCAATT AAAACAAAAA AAATAAGTTT 
ATAATTTTAA TCTAAATTGT GACGTGGTTT 
AAGTAAATCA AAAAGGACAA AATACAAAAT 
ACGATATATA AATTCTAAGG TTTTTGTCCT 
TGCATAGAAC TGGTGTCTGA GTTGGGAAGG 
GAGACCGTGG GTGAATAATA ATGTTGTCTT 
GGGGATAGTT TTTGAGTTAG GCCTGGCAAT 
GCCTTGTTTT ATCACTGGGT GCGGTGGCTC 
TTGGGAGGCT GAGGCGGGCA GATCATGAGG 
TGGCCAACAC GGTGAAACCC GTGTCTACTA 
GCGTGGTGGC GCATGCCTAT AATCCCAGCT 
AGAATCACTT GAACCAGGGA GTTGGAGGTT 
CACTGCACTC CAGCCTGGGT GACAGAGCAA 
AAGCTTGGTT TTCAATGGTT CTGAAAAATG 
GTTAGTCAAG TTTTGCACTT GGATAAACAG 
CTAGTTTATA ATATGGGCTT TCAGAAGTTA 
GAGAATTCAA GCTGGATGCT AGAGAAGGAT 
AGGAGTGCTA TGAAATTATT TGATCTTGGA 
TTTTTTTTTT TTTTTGAGAC AGAGTTTCGT 
CTGGAATGCA GTGGCACGAT CTCGGCTCAC 
TTCAAGCAAT TCTTCTGCCT CAGCCTACCA 
TGCGCAACCA TGCCCAGCTA ATTTTTGTAT 
CACCATGTTG GTCAGGCTGG TCTTGAACTC 
TGCCTCAGCC TCCCAAAGTG TTGGGATTAC 
GGCCTGATCT TAGAATTTGA AGGAGAGACT 
AATGAAAAGT TACCTTTCTG TATTCTAATA 
TTTAGAATGT TTTAAGTATC TTGGGCAGTC 
TTTTCTTTTC CTTCTTTCTT TTTAAGTGGA 
CATAGGTGGC ATTGCCTACT TTTTACATAA 
TGTCAGAAGG GCTGTGGCTG TTTGCAGTTT 
CTTTATAGGA GATGTAGTCT TCACAGTGAG 
TTTGTTTTTG TATAGCTTAT AGCAATGCAG 
TTTTCTTTTT CTTTTTGCAG TGATTATTTA 
CGACTCAGGG GTTGGAAAGT CTTGCCTTCT 
GAAATTGAAA TGTCTTTACA ATTAATGGTA 
TTCTAGGTAG ATAAAATTAA ACAGTTTTAT 
CAGAATTTAT AT AT T T AAAG ACTCCAAATA 
GGACTGTTAA ATAGAAAAAT ATTGTTGCTC 
TCTGGTTATA AAGTCAGAAT GTCTAATACT 
AGTTCCAATA TAGGCAGCAA GTTAAAGGGG 
AAGCGACTTG ATGAAAATAA ATCTTTAAAT 
AGAAAAAGCA GAGCCAGGTG ACGCAGTGGA 
ACTCAGGGTG CTGAGGGTGG AAGGATCACT 
CCAACCTGGA CAACATAGCA TGACTCTGTC 
AAAGAAAAAG TAGGGTCTTG GACAAACTTC 
TAAATGCTGA AGCTGACAGA TAAAGGACTT 
GTGTCCTTCA TAGTCTTTAT CAACTACCTT 
GCCAGGTGCG GTGGCTCACG CCTGTAATCC 
ACGGGCGGAT CACAAGGTCA AGAGATTGAG 
GAAACCCCGT CTCTACTAAA AATACAAAAA 
CACGCCTGTA GTCCCAGCTA CTCGGGAGGC 
AACCCAGGAG GCGGAGGTTG CAGTGAGCTG 
AGCCTGGCAA CAGAGCAAGA CTGTCTCAAA 
AAAACAAATT AGCATGTTTC CCTTCTAGAG 
GGACCAAAGA CTCCTGGGGG TTACCAAGAC 
GTCAAAATAT CCTAATAATA CTAAGATGTT 
TACTTGGTAA TAATACTAAT AT AAAAG AT G 
CATTGGCTCT GGTACAAAAG CATGTGGGTA 
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28351 AAATTGCTGC TGGCTTGGTA CACATCAAGG CAGCGCTAAG CTCCAAATTG 

28401 TACTCATGGT GATGGCATTC TTTACCTCTG TGCCCTCACA GGAACAAAAA 

28451 CAAGCCGTGC CATTTTTATT GAAGATTGTC CTTGACAAAA CAGTTAAAAT 

28501 GATTAATTTT TGAAAAATGT TGATCCATGA GTATTCCTTT AAAAATATTT 

28551 GTGAAGAAAT GGGAAGTTCA CATAAAACAA TGTTTTTTTT TTGTTTTTTT 

28601 TTTTTTTTTT TTTTGAGACA GATTCTGGCT GTGTTGCCAA GGCTAGAGTG 

28651 CAGTGGCGTC TGGCTCCCAG GCTCAAGCTG TTCTCCCACT TCAGCCTCCC 

28701 AAGTGGCTGG GACCTCCCAA GTGGATGCGC CATCATGCCT GGCTGATTTT 

28751 TGTATTTTTT TGTAGTGACA AGGTCTCACT GTGTTGCACA GGCTGGTCTC 

28801 AAACTTCTGA GCTCAAGCGA TGCATGTGCC TCAGCCTCCC AAAGTGCTGG 

28851 AGAAAGCACT TTTTACTGCA TACTGGCTAG TGTGTTGGTT ATTTTGGAGA 

2 8901 AAAGAAAAGC ATTTGTAGTT TTTTGAGTTG TAAGCTGAGC TAACTGCTTT 

28951 ATTTTTTTCT GTGGAACACC ATTTCTTTTT TTTTTTTTGA GATGGAATAT 

29001 TGCTTTGTTG CCCAGGCTGG AGTGCAGTGG CACAATCTCG GCTCACTGCA 

29051 ACCTCCGCTT CTCGGGTTCA AGCAATTCTT CTGCCGTAGC CTCCCAAGTA 

29101 GCTGGGATTA TAGGCACCTG CCACCAAGCC CAGCTAGTTT TTGTATTTTT 

29151 AGTAGAGATG GGGTTTCACC ATGTTGGCCA GGCTGGTCTC GAACTCCTGA 

29201 CTTCGTGATC CGCTTGTCTC AGCCTCCCAA AGTGCTGGGA TTACAGGCGT 

29251 GAACTACTGC ACCTGGACAT TTTTTTTTTT TTTTTAACTT GAAAGAACAG 

29301 CTAACAGACA GATTAGAACA GAATTGGCTA TTTGACAGAT TTTCTCAGAT 

29351 GAACTGTGAT AGTCATTTCA AGGGAAGTAG CTGCAAGCAT TTGTTGGCTG 

29401 AAATAAAATT TAAGTTTATC ATGGAAAATT AGAATTTGAA AAAACTTAGA 

29451 GTTTACCACT TGACAGTATC CTAAATACAT ATGACTTTTC TGATGAGTGC 

29501 CGATATTAAT GAAGGTTATT TAAAAAATAT TAAATAATGT ATAATTCTTT 

2 9551 TTATATAACA GTTAAAAATA AAACCATGAG TACTAGAATA AAACATAGGT 

2 9601 GGCTCTTTAA TCTTGGTTTG TGAAGGTATT TTTTAAAATA AGAAAAAAGC 

2 9651 AAGAAAT C AC TGCTAAATTT GACTATTAAA ATTAATTTAT CACAGGCACA 

2 9701 AAAATGTTAG AAAACTAATG GCAATAGCAA AT AT AT AT AT ATGAGGATTG 

29751 GTATTCTCAA CATATAAAGC ACATTTGCAC ATCAACAAGA AAAGAATATT 

29801 TCTCCTAATG GAAATAGTGG CAAATACATG AGCAGTCAGT TGAAAAAAGA 

2 9851 AGTAATACAA ATTGCTGGCT GGGTGTGGGT GGGGTCACGC CTGTAATCCC 

29901 AG CAT T TAG A GGCTGAGGCT GGCGGATCAT CTGAGGTCAG GAGTTCGAGA 

2 9951 CCAGCCTGAC CAACATGGAG AAACCCTGTC TCTACTAAAA ATACAAAATT 

30001 AGCCGGATGT GGTGGCGCAT GCCTGTAATC CCAGCTACTT GGGAGGCTGA 

30051 GGCAGGAGAA TTGCTTGAAC CCAGGAGGCG GAGGTTGTGG TGAGTCGAGA 

30101 TCGCACCATT GCACTCCAGC CTGGGCAACA AGAGCGAAAC TCCATCTCAA 

30151 AAAAAAAAAA AAAAAAAAAA AAAAGGAAGT AATACAAATT GCCAATAAAT 

302 01 ATGGAAAAAA AAAAAGGCTC AACTTTATTT GTAATTAAAG GCCTTTAAGT 

30251 TAAACTTAGG TGTCATTTAA TTTTTATTAA ATTGGCAAAT ATTAAAATTA 

30301 AGCATAATTC TTAAGCAACT CTCGGTAGGT GGGAAGAATC TAGCTGTAGC 

30351 CTCAGGTGTT TGTGCCTCAA GGAAAACCCT CTCTGGGATG TCCATTGCTT 

30401 GAAGTCAAAG GTTTTCCAAT AATACCTGGA AACTATTTTT AAAATGCTGA 

30451 TCCCCATACC CTCAAAATAT TAATAGAGAC AATCGTGAGG ACTATAATAA 

30501 AGAAATGTGC AATAAGCTCT GGGGGCACAG AGGGAAGAAT CTATTGGCTG 

30551 AGGAGTTGAA GAAATTGTTT GGACACTCAG TATTGCCTGA GCTCAAAACT 

30601 GAAGGATGAA TAAATGCCAC ATGACCTTGG GGCTGGGGAG TAAGTAGGGT 

30651 TAT GC AG AG A GAGATAACTG AGGCTTTTGG GCAGACGAAT AGTAACGGCT 

30701 CAGGCATGGG AGTAAAGGTC ATTTAGAGAT TTACAAGAAT TCAGCATTTC 

30751 TTTCTTTTTC TTTTTTTTTT TTGAGATGGA GTCTAGCTCT GTCATCCAGG 

30801 CTGGAGTACA GTGGCATGAT CTCAGCTCAC TATAACTCCC ACCTCCCGGG 

30851 TTCAAGTGAT TCTCATGCCT CAGCCTCCCG AGTAGCTGGT ATTACAGGCG 

30901 TGTACTACTG TGCCTGGCTA ATTTTTGTAT TTTTAGTAGA GATGGGGTTT 

30951 CACCATGTTG GTCAGGCTGG TCTCCAACTG CTGAGCTCAA GTGATATGTG 

31001 CACCTCTGCT CCCCAAAGTG CTGGGATTAC AGGCGTGAGC CACTGTACCC 

31051 GGCCAAGAAT TCAGTATTTC TATCCAAGTA CCTGGGGGAT AGATGTGCTA 

31101 CATGAATATT TATTGCATTC ATTTTGTTCT CTGCATTTTT TTTTTTTTTT 

31151 TTGGTTTGAG ATGGAGTCTC GCTCTGTCGC CCAGGCTGGA GTGCAGTCGT 

31201 GCAATCTCGG CTCACTGCAG CCTCCACCTC ATGGGTTCAA GCGATTCTCC 

31251 ATCTTGGTCT CCTGACTAGC TAGGTTTACA GGCGTGTGCC ATCACACCCA 

31301 CTAATTTTTT GTATTTTTAG TAGAGACAGG GTTTCACCAT GTTGGCCAGG 

31351 CTGGTCTTGA ACTCCTGATC TAAAGTGAGC CTCCCACCTT GGCCTCCCAA 

314 01 AGTGCTGGGA TTACATATGT GAGCCACTGC GCCTGGCCTC TATATACTTC 

31451 TATAGTACCT GATACTTATT AGGCACTCAA TTACAACATA ACTTTTTTTT 
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31501 TTTTTTTTTT TTTTGAGACA 
31551 CAGTGGCACA GTCTCGGCTC 
31601 ATTCTCCTTC CTCAGCCTCC 
31651 CACGTCCAGC TAATTTTTTG 
31701 TTGGCCAGGC TGATCTCAAA 
31751 CTCCCAAAGT GCTGGTATTA 
31801 TTCTAAAAAC ATTTTCTTAT 
31851 ATACATTTCC ATTTGGTTGT 
31901 TATACCTTAT CTTTTTTATT 
31951 AGAAACTGAA GCACTTGTCC 
32001 TTTTGTTTCC TTGTGATGTC 
32051 TAATTAGAAG ACCTAGAACT 
32101 TGTAGCTATT TAAATTTAAA 
32151 ACACCTGTAA TCCCAGCACT 
32201 GTGCAGGAGT TCAAGGCTTC 
32251 GACATCAAGA CCCTGTCCCT 
32301 AAACTTAGTT TCCTTGTTTC 
32351 TTGGCTAGCA CAGCTTATAG 
32401 TGGCAGTGCC GTTCTCATTA 
32451 GACAGAGTAC TTCTTAGGTG 
32501 GTCTGTTCAT TCATATATCT 
32551 AATAAAATTT CATGGCAGGT 
32601 TTAACATTAG GTTAAAAAAT 
32651 TTGTACACAA ATGTTCATGG 
32701 AACAACCCAA ATCAATTTAT 
32751 CATGTAATAT TATTCAGCCA 
32801 CTGTAATCCC AGCACTTTGA 
32851 AGGAGTTTGA GACCAGCCTG 
32 901 ACGTACAAAA ATTAGGCAGG 
32 951 CTCAGGTGGC TGAGTCATGA 
33001 GCAGTGAGCT G AG AT CAT G A 
33051 CATCCTGCCT CAAAAAAAAA 
33101 GTACAACATG GATGAACCTT 
33151 ACACAGAGGG CCACATATTT 
33201 TTGGCAAATC TAAAGAGAAA 
33251 GGGTTCTTTC TGGAGTGAAG 
33301 GCAACCTTGT GAATGTATAA 
33351 GACTTTTATG TTATGTGCAT 
33401 GCAAGATTGA CTTCTAAGAA 
334 51 AATAAATTGG GTGGGTGGGT 
33501 AAATTATTCA TAATCATTGA 
33551 GTAAAGAATT TGGCAAGAGA 
33601 TGTAGAGATC TTAAGGAAAG 
33651 GAATATGAAG TAAGTACTGA 
33701 TATACTTTAT ACATTTCATT 
33751 TTTTTTTTTA TTTACTCTTG 
33801 TGCATCTGCT CCACTGAACA 
33851 CAGGATGATA CATATACAGA 
33901 CAAAATAAGA ACTATAGAGT 
33951 TAAGTGACTT GGCTAGTAAT 
34001 ATGTATTGTT ATTTTGTATA 
34 051 AATGTCTTAA GATTCATGTA 
34101 TTAGATTAAA AATAATATTT 
34151 TGTTCATGAA GCAGAGAAAC 
34201 TTTTTCTTTT ATCAGTGGAA 
34251 TGCAAATGTT ATAGACCATT 
34301 GGAACATTAA AGGAACCCTA 
34351 AAGAGAAGTA CTTATGTAGT 
344 01 AACTCTGTAG AAACACAGAA 
34 4 51 AACATGTTCA TTTTACTGCT 
34 501 CTTGCCTACA GTTACTGGTA 
34551 ATTTTGTGTC TGTGTGCCAT 
34 601 AGCCCCCAAT TAAATATAGG 



GAGACATGCC TTGTCGCCTG GGCTGGAGTG 
ACTGCAACCT TCACCTCCCG GGTTCAAGTG 
CGGGTAGCTG GGATTACAGG CGCCCGCCAC 
TATTTTTAAT AGAGATGAGG TTTCACCATC 
CTCCTGACCT TGTGATCCAC TCACCTTGGC 
CAGGTGTGAG CCATCATGCC CGGCCCATAT 
AAAATGACAT TGCCATTATC AACCTGCAAA 
TTTCTTGCTT AGTCTTTTAA TCTAGAGTTT 
TATATATTTT TTATGTCATT GACTTTTTGC 
TGTAGATTGT CCAATATTCT AGATTTGTCA 
CTTATGCTTA TTTGTTTGTC CCTCTTTCTG 
GCACTATCCT TAGAGTAGCT ACTAGCTCTA 
TTAATTAAAA TTGAAAAAGT TTGGTGGCTC 
TTGGGAGGCC AAGGTGGGAG GATTGCTTGA 
AGTAAGCTAC GATTGTACTC TAGCCTGGGA 
TTAAGGGGGA AAAATAATTG AAAAAATCAA 
ACAAGCTGCA TAGGGCTAAT GGCTACCATA 
AACCTTTCCA TTGTCACAGA AAGTTCTGTT 
GACCTGATTC GATTAAGGTC CATCTTTGTT 
GTGCTTTGTG GTTCATATGA TGATAGCCTG 
TTTCACGAGA AATATTTTTA TTCCATTCTG 
ACTTGCAAGA AGCAGTTATA ATTTTAAAGT 
TGACAGGAAA CAT AT ATT C A CAGGTAAAAC 
CAGCATTATT CATAATAGCC AAGAAGTGGA 
GAATGGATAA AATGTTGTAT ATTTGTAGTA 
ATAAAATGGG CCAGGCATGG TGGCTCACAC 
GAGGCTCAGG CAGGGGGATC ACTAGAGGTC 
AC CAT CAT CA CGAAACCCTG TCTCTACTAA 
CGTGGTGATG CACGCCTGTA GTCCCTACTA 
GGATTGCTTG GACCCCGGGA GACAGAGGTT 
CACTGCACTC CAGCATGGGC AACAGAGCAA 
AAAAAAAAAA AAAAGAAGTA CTGTTACATG 
GAAAACATTC TGCTAAATGA AGGAAGACAG 
TATGATTCCA TTTATACGAA ATGTCCAAAA 
GTAGATTAGT GGTTGCCAGG GAGTGAAGAC 
AAAATGTCCT GGAATTCGTG GTTGTAGTTT 
GG ACC ACT G A ATTGTCCACT TCAAAAGGGT 
TATATCTAAA AAAAAAATCA TAATTAGGAA 
AAAGCGGAGT GAAATTGTTG TTTTGTGGTG 
CGCAAGAGTT TTGCTGATTA GTGATTAGAA 
AAATATAAAA TATTTTTCTA TAT GAT GT AT 
TGATGTTTGG AAAAAATAAA GAATGGCTAT 
AAACTACAGT TAAGTAGTGC TTTGTAATCA 
AAGTGGATGG AGTGGCTGTT GTCAGCATGT 
CATAAATTTG GACTGTAGAT AAAAGTAAAC 
AACAACAGTT TTTTTTTTTC CACTTAGACT 
ATACATTTAA TTGTTAATTA TTTCCCCCTT 
AAGCTACATC AGCACAATTG GTGTGGATTT 
TAGACGGGAA AACAATCAAG CTTCAAATAG 
TTTTTTGAAA TTTATTTTGG TAAATTTGTA 
TAT TT ACT AT GCTAACAAAA TTGAATGTAA 
CTTAAGATAG AATGGTAGAA TAAGAATTAC 
TCAAGATTAC TTAAGCCTCA TTGAATTTTC 
TCATGTTTTA AGTCAAACTT GGTCCTCATC 
ATCTAAGTTC AAGTTTACCT TGTCCTACAC 
TTTGTTTGTC TTTTACTGTG CTAAGTGCAT 
GGAAGAGATT CTTCATATGT GGCTCAGTTG 
TCTAAGTATT TTTATTAGAT AGTGTGCACC 
TTTTGTTGGA AAAAGGAACT TAGTTTTTGT 
CAAAAAAACG AATGCTGAAA GATTTAATGA 
GAACCAAGTG ACCGAAGCTC TGTCTTCAAT 
CCTATCCCCC TTATCCATCT TTACACCCCC 
CAATTATAAT AGTTCAGTTG TGCCTCTTCA 
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34 651 GTATGGGTCT GAGTCCTGTC 
34701 ATAAATCTCT CAGTATTTTT 
34751 TAAACACAGG AAACCAGTGA 
34801 GGAATTGGAA GGAGTCCTGC 
34 851 CTTGGTTCTA GTATTTACCT 
34 901 GAAAAGGCCG AAGTAACCCT 
34 951 GATGACACCT GAT T TACT AA 
35001 ATAGCTTTGT GTATGTGACA 
35051 GGCAGCATAT AGATACTACT 
35101 TTTGATAGGT TAAGTTGCTC 
35151 GAAGGTATGT TAACAAAAGA 
35201 CACACTTGAT GTGTCAAATT 
35251 AGCTTGAAAT ACTTTCAATC 
35301 CACGATATAA GTACAGTCTC 
35351 TAAAGATCTA CCCAAAACTG 
354 01 ATTATGTGTG ATCATTGGTG 
35451 GAAACATTCA TATTTTGTTA 
35501 ACACTGTCTT TCATCTTTGT 
35551 TCCATGCCTG AAGGCCTCAT 
35601 TTTTTGAGAT ACAGTATTGC 
35651 GATTTGAGCT CACTGCAACC 
357 01 CCTCAGCCTC CCTAATAGCT 
35751 ACAATTTTTT TTTTTTTGAG 
35801 GTGCCAGTGG TGCAATCTTG 
35851 AGCAGTTCTC TTGCCTCAGC 
35901 CCACCACACC CTGCTAATTT 
35951 CATGTTGGTT AGGCTGGTCT 
36001 CGGCTTCCCA AAGTGCTGGG 
3 6051 TCCGATAATT TTTGTATTTT 
36101 AGGCTGGTCT CAAACTCCTT 
3 6151 CAAAGTGCTG GGATTACAGG 
3 6201 TTTTTAAGAA GCTTACCATT 
3 6251 TAACTTCAGC TAGGCATGTT 
3 6301 AGGCCGAGGT CAGAGAGTCA 
3 6351 GACCAGCCTG GTCAACATAG 
36401 AAAAAGTAAC CAGGTGTGGT 
3 6451 AGACTGAGGT GGGAGGAATG 
36501 GGCAAGATTG CACCATTGCA 
36551 TGTCTCAAAA ACAAAATAAG 
36601 GTTCACGCTT ATAGTCCTAG 
36651 CTGAGCTCAG GAGGTCTAAG 
36701 CTTTGCAAAA CATACAGAAA 
36751 GCCAGTTATT TTTTTTATTT 
36801 TCTGAGACAA AAATTTCACT 
36851 CTCACTACAT GATTCTTTTA 
36901 AGCCTGTTTC TTTTACTTCT 
36951 TGCCTGAATC AGATTAATTC 
37001 TGATCTTATC ACAACCTTTC 
37051 TAATGATATT TTCAGAACCT 
37101 AAATGCCTTT TATTAAATTT 
37151 CATGAGGTAA ATCCTGTCTT 
37201 AACTTCAAGG GCTCATGGCC 
37251 TGTTGTTTCA GTAGCTATAT 
37301 TTTGCTTGTT TTTTCTTTTT 
37351 CTCTTGTTGC CCAGACTGGG 
37 401 ACCTCCGCCT CCCAGGTTCA 
37451 GCAGGGATTA CAGGCATGTG 
37501 GTAGAGATGG GGTTTCTCCA 
37551 CTCAGGTGAT CCGCCTGCCT 
37 601 TGAGCCACGG CGCCTGGCCG 
37 651 TTGGCCAGGT GCAGTTGTTC 
37 701 GAGGCAGGCA GATCTCTTGA 
37 751 ATGGTGAAAC CCTGTCTCTA 



AGTGTGGGCA TATCTGTGGT CTTTTAAAAA 
CAGAGTAGGC TATTAGCAAG AAGTAGGCTA 
CTGCCCCTTT TCATGGAACT GATGACACAT 
ATTAGGAGTC AGAAGACTTA GATTTGTTGT 
GTTAGAGAAT CATGGGTTTG TGTCTCTGGG 
GAGACCCAGT TTCCTTTCTA AAATGTGTGT 
TTTATAAGCT AGTTGTGAGA ACCAACTGTA 
ATACGTGTGA AAGCCCTTTG TAAACTTTTG 
TAT GAT AT G A CATGCCCAGA TAAATGGGTG 
CCTTTTCTTA CATGACTCTG ATGAGGAAAA 
TAGGTGGCTG TGGATATTGA TATAAGTAAA 
AGGACTTGCA AGGATTTAGT TTTCAGAAAT 
AGTGAACAAA TTACCCTCCA TATTTTTTCC 
AACCTTTTAT TTGGCACCAT AAAGAGCACA 
TACTTTAAAG CACTGGTATG GAATAATTGT 
TTTATAAGAT TTGGGTGTGT ATTCGTGTGT 
CTTTCCTGTG GCTGGAAGGG ATCTTATAGG 
CTGTCTTTCA TCTTTAATAG GAATTTCTTT 
TTTGAACATT TTGTTTGTTT GTTTTTTTAT 
TCTGTCTCCC AGGCTGGAGT GCAGTGGCGC 
TCCGCCTCCT GGGTTCAAGT GATTCTCCTG 
GGGATTACAT GTGTGTACCA CCATGCCCGG 
ATGGAGCCTT GCTTTGTCGC CCAGGCTGGA 
GCTCGCTGCA GCCTCCGCCT CCCAGGTTCA 
CTCCTGAGTA GCTGGGATTA CAGGCGTGCG 
TTTGTATTTT TAGTAGAGAC AGAGTTTCAC 
CGAACTCCTG ACCTCGTGAT CTGCCTGACT 
ATTACAGGCA TGAGCCACTG TGCCCAGCCT 
TCGTAGAGAT GGGATTTCGC CATGTTGGCC 
ACCTCAAGTG ATCCACCCGT CTTGGCCTCC 
CGTGAGCCAC CACGCCTGGG TTTTTGAACA 
TTTTCGAAAT AGCTAGTTCC ATTTTACACA 
GCCTCATGCC TGTAATCCCA GCACTTTGGG 
CTTGAGGCCA GGAGTCAACA TAGCTCCTGT 
AGACTCTATC TCTACCAAAA AAAAAAAAAA 
GGTCCATGCC TGTAGTCCTA GCTCCCCAGG 
TTTGAGCCCA GGACTTCAAG GCTGCAGTGA 
CCCCAGCTTT GGGGACAGAG TGAGAGACCC 
GCTGGGCGCA GTGGCTGTCC GGGCGTCGTG 
CACTTTGGGA GGCCAAGGTG GGCAGATTGC 
ACCAGCCTGA GCAACATGGC GAAACCTCAT 
AAAACAAAAA AAACCACAAA ACCTCTAGTT 
ATTCCTAGTG ATTCTTCTTT TTTTCTTTTT 
TTGTCTCCCT CGCTAGAGTG CAGCGGTCAG 
GAGACATGTT AATTCTTTAT ATTGAGCTGA 
GTCTCTTCTT ATTCCTCCGC CTTGTAGAGC 
CTCTTTTATT GGCAAGCCTG CCCTTCAGAT 
TTCTACCTCT GAAGTCCTCA TTCTTTCCTG 
TGTGCAATTT GGGTTATTCT TACATTTTAT 
GATTTCTTAA ATCAAGTATG AGATATAACA 
GATTTGGAGC CTGAATGAAT TTCTCTCTTG 
CTTTCTTATT ATTAATCAAA GACAACCATT 
TATTTCTAGT TTGGGTCTTA AGGTTTTTGA 
TCTTTTTTTT TTTTTTGAGA CGGAGTTTCG 
AGTGCAATGG CGTGATCTCG GCTCACTGCA 
AGCGATTCTT CTGCCTCAGC CTCCCTAGTA 
CCACCACGCC GGGCTAATTT TGTATTTTTA 
TGTTGGTCAC GCTGGTCTCG AACTCCCGAC 
TGGCCTCCCA AAGTGCTGGG ATTACAGTCG 
ATTTGCTTGT TTTTAATTAA AATAGGGGCC 
ACCCCTGTAA TCCCAGTACT TTGGGAGGCT 
GTTCAGGAGT TCAAGACCAG TATGGGCAAC 
CCAAAAACAC AAAATTCAGC CAGGCATGGT 
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37801 GGTGTGTCCC TGTAGTTCAA 
37851 CTTGAGCCCG GAGATGGAGG 
37 901 CTCTAGCCTG GGCAACAGAG 

37 951 AGGGTCTCAC TTTACACTTC 
38001 GGCCGGGCAC GGTGGCTCAC 
38051 GGCACGCAGA TCACAAGGTC 
38101 TGAAGCCCAT CTCTACTAAA 
38151 AGGTGCCTGT AATCCCAGCT 
38201 GAACCCAGGA GGCGGAGATT 

3 8251 CAGCCTGGTG ACAGAGCGAG 
38301 AAACACAAGA TTTTACTTTT 
38351 GTAATCCTTC AAGTTTTCTA 
38401 TATATTTTAA TTATCTTTTC 
38451 CTGGGTCTTT ATCTGAGTTT 
38501 GGCAGGTACT GAGTCCAGTT 
38551 GTTCATTCAT GAGGCTTTAT 
38601 AGGTTACAGA GATGAATAAA 
38651 GAGACAAACA TGGGAAAAAG 
387 01 ACATATGAAG GAAAGTATTG 
38751 GTTTTCAGAG TAGCAAGGAA 
38801 ATGGTGCATG TATTATTCTG 
38851 ACTGGGTGAT TTATTAAAAA 

38 901 GCTGAGGAGG CCTCAGGAAA 
38 951 TCTTACATGG CAGGAGGCGA 
39001 CCCCTTATGA G AC CATC AG A 
39051 AT GGGGG AAA CCGTCGTCAT 
39101 TTCCCTCAGT GATTACAACT 
39151 CAAACCATAT CAGTGCCTGT 
39201 GGAACACTTG AGCCCAGGAG 
39251 CCTCCATTTT GGATAAAAAG 
39301 GTTTAATCAG CTGTAACTTC 
39351 CCTTCTATCT TTGTGAAAGA 
394 01 GCTTTATTGA GCAGCTTCTA 
39451 ATTTAAACCC CTTTAGCTGT 
39501 AGATGAAGAA ACTGTTAGGT 
39551 CTGGATTTGA TGTTTTGAAG 
39601 TCCTCCAAGG ATACACTGAA 
39651 TTTTTTTGAG ACAGGATCTG 
39701 CTCAGCTCAC TGCAACCTCT 
397 51 CCTCTCGAGT AGCTGGGACC 
39801 TTTGGATTTT TAGTAGAGAC 
39851 CGAACTCCTG AGCTCGTAAT 
39901 ATTACAGGCG TGAGCCACCG 
39951 AACTATAATG CAATAATGTT 
40001 GGTTGAAGTG CAGTAAATTG 
40051 GAGTCTCTTC CTTACATTTT 

4 0101 GGCATGGTGG CTCATGCCAG 
4 0151 GATAATTGCT TGAGTCCAGG 
40201 GACTCTGTCT CTACAAAAAA 
40251 CCTGTAGTCC CAGCTACTCA 
40301 GGGAGGTTGA GGCTGCAGCG 
40351 GATGACAGAA TGAGACGCTG 
404 01 GGGATGATTA AAGTTCTGTC 
40451 CCAGTGCTTT TAATCATGGT 
4 0501 ATGGAAGAGT TCTGTGTTCA 
40551 CAGTGTTAAA GGCGGTAGGG 
40601 TCAACAGCTT GTAATTTTAG 
40651 TACTTCTTTT CTGGACTTTC 
407 01 TGAGTTGAGG ACTAATGTTT 
40751 ATGAAAACTA AACTATTTCC 
40801 TTTAATGGAA TTGTTTGGAT 
4 0851 GCAGGCCAGG AAAGATTTCG 
40901 CCATGGCATC ATAGTTGTGT 



GGTACTCAGG AGGCTGAGGT GGGAGGATTG 
TTGCGGTGAG CCAAGATTGT GCCATTTGCA 
CGAGACCTTG TTTCAAAAAA AAAAAAGAAG 
TGTGACTGGT GTTTTAAAAA TCTAAACACA 
GCCTGTAATC CCAGCACTTT GGGAGGCAGA 
AGGAGTTCGT GACCAGCCTG GCCAGCATGG 
AATACAAAAA AATTAGCTGG GCATGGTGGC 
ACTTGGGAGG CTGAGACAGG GGAATCACTT 
GCAGTGAGCC AAGATTGCGC CATTGCACTC 
ACTCCGTCTG AAAAAAAAAA AAAAAAATCT 
AATCCTATCA TTTCCTCTTG CTTGGCTTCA 
GGTCTTTTCA AAATCTTGAT TCTGTTGATT 
CTTTCAGCTT TTCCTGTTCA GGTGTGACAT 
TATTAGATTA TAAAACATTC AGCAAGATAG 
GTACACCATG GAAGGCCTCT TTCTGTGATT 
GAAAATGTCT ACATTACACC AGGCACTTGG 
ACATAGTCCA TTAGGAGGCA GACAATGGGA 
TTACTCTGAT TATGAGGAGT AATGAGAATT 
TTAGTACTGT TAGGATTTAG TGTCAGGAAA 
ACATCAGAAA TTTTACTCTT TCTGCCAGGC 
TTCTCACACT GCCACAAGGA ACTGACCAAA 
AAAGGTTTAA TTGACTCATA GTTCTGCATG 
CTTACTGTGG CAGAAAGGGA AGCAGGCACG 
GAGAGTGTGA AGGAAGTGAA GGGGGAAGAG 
TCTTGTGAGA ATTCATTCAC TATCACTCGA 
AATCCAATCA CTTCTCCATA ATCCAATCAC 
T GAG AT GAGA TTTGGGTGGG GACACAGAGC 
AGTCCCAGTT ACTTGGAGGC TGAGGCAGGA 
TTCAAGATCT GCCTGGGCAA CATAGCAATA 
GAAATTTTAC TTTTTGGGTG CCATTGCTTA 
TTGTTGACTT TTAGTCAAAA AACAATTTTT 
GGTTGGTGAG CAAGGAAGAA AAGGAAACTT 
TAGTCAGGCA CATTTTACAA ACATTAGTTC 
TGTACAAGGT GAATGCTATC TAG CAT TT AC 
GACTCTCCCT AATATTAAAT AACCAGGAAC 
TCAGGGTAGC TTGATCCTCG AGTTCATGCT 
AGACTTTGAG CCTCTTTTTT TTTTTTTCTC 
GCTCTCTTGC CCAGAGTGCA GTGGTGTGAT 
GCCTCCTGGG CTCAAGCGAT TCTGCCTCAG 
ACAGGCGCAC GCCAGCATAC TTGGCTAATT 
AGGGTTTCAC CATGTTGGTC AGGCTGGTCT 
CCGCCCGTCT CGGCCCCACA AAGTGCTGGG 
ACCCAGTCCC AACAGTTTTT TAAAACCCAG 
AGCATTTGTT TTGGGAGTTT GAGCCTAAAT 
TTCTTAAAAT ACGTTTTATG AAAGTATTTG 
TTTCTCTAGC ATGAAGACAA CACCTAGCCA 
TAATGCCAGC ACTTTGGGAG AAT G AG T TAG 
AATTTGAGAC CAGCCTGGGC AATGTAGCGA 
GAAAAAATTA GCCGGGTGTG GTGGCATGTG 
GGAGGCTCAG GTGGAAGGAT TGCTTGAGGT 
AG C CAT GAT C ATGCCACTGT ACTCAGCCTG 
CTTGAGAGGG GAAAAAAAAG ACACCTGCTT 
TTGACTGGTA GTTATTTGAA TTAGGTCCCT 
AGAATGTGCT AGCAAGTGAG TTTGTCTTAC 
AGGGCTTTCG GCCAGTGGCA TTCCTAAACA 
AATGTGAAAA GTATGACATA GTTCCTGCTC 
TATTATTATC GTAAGCTCAA TTGTAGGTAC 
AGGTGCTTAT TACCGTGCAA TTTAGTGGTA 
CTATATCACA TCCTGATAAT CTCCACAGTT 
CCTCCCTCCT ACACTTTTCC CCAACTTTAT 
TTCTTGATTG TTTTGTAATA GTGGGACACA 
AACAATCACC TCCAGTTATT ACAGAGGAGC 
ATGATGTGAC AGATCAGGTA AGTTCCAAGA 
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4 0951 GGAGATTGTG TTACAGTGAC CAAGTAGGAA GCCATTATTT GATTAATGTC 
41001 AGATTCATTT ACTACTTCAT ATATAAGCCA TCAGTATTAA TTTTATGGCA 
41051 GAAAACTTTG TCCACTCTCA AATATAAATG TGAATCACTT AAAAGACATT 
41101 TGTTTTCCTG TAATAAATAA AAGATTAGTA ATTAGTTTTA CGTTTGCTTT 
41151 CAAGGGATTC TGGTTGTATT TATTGTCAAC TAAATAACTT TGATCAAATA 
41201 GCCAAGACTC TAACATATAG GCAAGAGTTT GTAGGGAATC GTGAGTTGCT 
41251 TGGCTTATAC TGTGTTCTTG GTGTTAAGTA TTAACAGGAA TATGGCCTGG 
41301 TAATTAGAAC TTGTCCATCA GAATTGCCAA AAGTGGGATT CGGGGGTCTC 
41351 TGCCTATGGA GGATGTGGTT CAGAAATAAA GAATTTGAAT AGGATAAGCT 
41401 GTAGGAGGAT CTTAGTATGA GAATGAGTAT CTGAAGATTA GCTGTGAGAG 
41451 AGGGCAGAGC GATGGAGGGA ACAATGTGGG ACAGTGTGAA GCATGTGATC 
41501 CAGGGGCCAT AACTTTTTTT GTTACTATTT TTTTAAATCA GAAACTTAGA 
41551 TTTCAGTGTC CTTTCTATCA AAGAAAAGGA CAAAAGATAA ACGTTCAAAA 
41601 TTGGAATTTA TTTTTCTTTT GGCAAATGTT AAATCTCACC TCTAATGAGA 
41651 AATCATAGCT AATTAGGAGA TAACTTACAT GTAAGCATTT AGATTCAGTG 
41701 CCATTAGAAG TGCTGGGTGG GTGATATCTG CAGGAGAAAA AAATGATGCT 
41751 AGTTTAAAAA ATCTCTACTA TTACCGTGAA ATATTTTTAA ATGAAAACTT 
41801 TCGTCCTCTA AATATGACTG TGGAAAAGAA AATGAGTATA TTTAATAACA 
41851 TCTTTTGACA TCTCTAGTAG TAACAGTAGG TCATCTTATT CATAAACCAA 
41901 AATTTTACCA AATTTCAGGC CAGGCGCAGT GGCTCATGCC TGTAATCCCA 
41951 GAACTTTGGG AGGCCGAGGC GGGCGGATCA CCTGAGGTCA GGAGTTAGAG 
42001 ACTAGCCTCG CCAACATGGC AAAATCCCAT CTCTAGTAAA AATACAAAAA 
42051 TTAGCCAGGC GTGGGGGCCC GTGCCTGTAA TCCTAGCCAC TTGGGAGGCT 
42101 GAG AC AG GAG AATCGCTTGA ACCCAGCGGG CAGAGGTTGC AGTGAGCCGA 
42151 GATCGCGCCA TTGCACTCCA GCCTGGATGA CAGAACAAGA CTTTGTCTCA 
42201 AAAAAAAAAA AAAAAAAAAA AAAAAAATTA ATCAAATTTC AAAACCAGGT 
42251 TTTGTAGTAC ATTTAAATTG CATATTCCAA AGCAGTTGGG TTTGCCTGCG 
42301 TTGCAGTTTA ATATTAAGCT ATACTTCCCT TTCAAATAAG GTATTTTCAT 
4 2351 CGTTAAGCCT GTAAATTCTA GTTTGTCATT GTTTAGATAT TTATAGTCAT 
42401 TTTAATATAT CTGTTTACGG CCAGCTGCAA TGGCTAACAC CTGTAAACTC 
42451 AGCACTTTTT GAGGCCAAGG TGGGCCGATT GAGCTCAGGA GTTCGAGACC 
42501 AGCCTGGGCA ACATAGTGAA ACTCCATCTA TACAAAAAAT CCAAAAAAAA 
42551 AAAGACAGGT GTGGTGGCAT GTGCCTGTAG TCCCAGCTAT CCCGGAGGCG 
42601 GAGGCGGGAG GATGGCTTGA GCTTGGGAGG TCGAGGGTGC AGTGAGCTGT 
4 2 651 GATTGTGCCA CTGCACTCCG GCCTAGGTGA CAGAGCAAGA CCCTGTCTCA 
42701 AAAAAAAAAA TCTCTTCACT CCTTAGCAGT GGTTATTTTG TAGCTAGAGT 
42751 TGTCTCACTA GCTCTTTGTT ATTTGTCTGT TAGGTCAGGA ACGATGTTTC 
42801 TGTTTATTCC AGAACTATAT TAT CGAACTA TATTATCAGT CTTTCAAATG 
42851 TCTTTTTAGG AGTCCTTCAA TAATGTTAAA CAGTGGCTGC AGGAAATAGA 
42 901 TCGTTATGCC AGTGAAAATG TCAACAAATT GTTGGTAGGG AACAAATGTG 
4 2 951 ATCTGACCAC AAAGAAAGTA GTAGACTACA CAACAGCGAA GGTATGTTTA 
43001 AAGTTTAATT TTCATACTGA ATTTGAAGGT GTTGAATTAT GTATGGGTTC 
4 3051 TGCAGTAACA GTAAGGCCAC AGCCTTTTAA AAATATGTGC ACTAGAATAC 
4 3101 TGTGACAGTG ACAATTTGTG TAGCATCTGT TTGGATCCAA TGAACTTAGT 
43151 TCCTCACGCT CCATTATGGA TGGTAGAAAT GCAGTAAGAA TTAGTGAAAA 
4 3201 AGATTTTTCA GTGTTAATTG TGCCTCATTA TTCTCTTAGG AATTTGCTGA 
4 3251 TTCCCTTGGA ATTCCGTTTT TGGAAACCAG TGCTAAGAAT GCAACGAATG 
43301 TAGAACAGTC TTTCATGACG ATGGCAGCTG AGATTAAAAA GCGAATGGGT 
4 3351 CCCGGAGCAA CAGCTGGTGG TGCTGAGAAG TCCAATGTTA AAATTCAGAG 
4 3401 CACTCCAGTC AAGCAGTCAG GTGGAGGTTG CTGCTAAAAT TTGCCTCCAT 
43451 CCTTTTCTCA CAGCAATGAA TTTGCAATCT GAACCCAAGT GAAAAAACAA 
4 3501 AATTGCCTGA ATTGTACTGT ATGTAGCTGC ACTACAACAG ATTCTTACCG 
43551 TCTCCACAAA GGTCAGAGAT TGTAAATGGT CAATACTGAC TTTTTTTTTA 
43601 TTCCCTTGAC TCAAGACAGC TAACTTCATT TTCAGAACTG TTTTAAACCT 
43651 TTGTGTGCTG GTTTATAAAA TAATGTGTGT AATCCTTGTT GCTTTCCTGA 
43701 TACCAGACTG TTTCCCGTGG TTGGTTAGAA TATATTTTGT TTTGATGTTT 
43751 ATATTGGCAT GTTTAGATGT CAGGTTTAGT CTTCTGAAGA TGAAGTTCAG 
4 3801 CCATTTTGTA TCAAACAGCA CAAGCAGTGT CTGTCACTTT CCATGCATAA 
4 3851 AGTTTAGTGA GAT G T TAT AT GTAAGATCTG ATTTGCTAGT TCTTCCTTGT 
43901 AGAGTTATAA ATGGAAAGAT TACACTATCT GATTAATAGT TTCTTCATAC 
4 3951 TCTGCATATA ATTTGTGGCT GCAGAATATT GTAATTTGTT GCACACTATG 
44001 TAACAAAACA ACTGAAGATA TGTTTAATAA ATATTGTACT TATTGGAAGT 
44051 AATATCAAAC TGTATGGTGA TAAGTATTGT TTTGATTCTT ATGGTTAAAG 
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44101 GGAAATAGAG CCTTGCATTA TATTCAACAC AGCCATTTGT GTGTGCACAA 

4 4151 TGCAAACTAA GGTATTCTAG ACCTATCTTA GAGCAGCATC CAGTATTTGC 

4 4 201 TTTCTAGATA ATATGCCCAA TAACATGACC TAGAGGGGCT TCTGTGCTGT 

44251 GTAGGGATTT AACCAACTTC AGTGGTTCAG GGAGCTCAAA CTATATGTAA 

44301 AACAAGTTTA GAATGTATGC TATCTAGCCC GTTATCTCTG ATCCTTCTCT 

44351 AAAACCATTT GAAATAGCTT CATTGATCAA CATTTCATAA ATGCATCTGT 

44 4 01 GGTAGAGGTA GAAAGCAGCA CCTTTCCTAA TTGGCAAATG ATCAGACTAA 

44451 TGTGTGCTAA TGTTTTTCTT CCATGCTTTC AGTCAGATTC AACTATTTTA 

4 4 501 TCCTCCACAG TTGCTTAACT TGGTGTTGGA GGAGGGTTTA AGCATTAAGA 

44551 TAGGAAGCAG GAAATTTGAT TGCTCTAAAT TTAGAAATTA TATCCCTAAA 

44 601 AATTAAAACA TGAATACTGG GTGGTAATGA TAATTGAGGC AAATGTATTT 

4 4 651 ATTTTGGTGA CATTTTGCAT ATATGAAGAT TTTCTGAAAT AGGACCTTCA 

44701 AGATCCTAGG GGGTTTTGTT TGGTTTTTAA TTGTGAGGAA TAAAAAATCT 

4 47 51 TCTGCCCACA CTGGCATTTT AAGGTGACTG AGGTCAAACG TTGTTTCCTT 

44801 AGGTTGAAAT AGCAGCCAAA ACATTCTTCA CGCAGGGGCT TGGGATATGG 

44 851 CTGCTGGCAA CACATTTTGT TGTGGGCTCC TTAATTTAAT GATAAAATTT 

4 4 901 AAGCTAAACA CAAGCCAAAA ATGAATAGGT TTTTTTAATT TTTATTTTTC 

44951 ACTAAACAGG CAATTGAAAT ACATGGTACA AAAATAAGTG GTAAGATAAT 

45001 TGTAAAATGA AATGGACAGA ATATTCAATT TTCCATCTAT GAAAATTTCA 

45051 CAATAAAAAT CATAGTTTAC TTTGTATTAT AGGCGTGCTT GGTGGATCTA 

45101 TTCATCCTCA CATAAGGCAA CTGACAAATT CCTGAAGTTA CCAATAGTTA 

45151 TTTTGGTGAA GATCTTTAAT GCTTCAGAAG TTTTGTTTTT GCCTTAATAC 

45201 AGTATAAAGG GGGAAAGAGT TCAGAAACTA TTTTCTAAAG TAGCTAAATG 

45251 ACACAAAACA AATGTCAAGA TACTGTGATG CCATGCCGTG CACTTCATTT 

45301 TTACACAGTA AAAGTTGTTT AAATTGTCAG CTTATTCTTG GTGAGTTAGC 

45351 GGAAACATTA CATGAACTTA AGATGAGCAT ATTTACAGAC TTAAGTTTGG 

45401 AAAATTCCAG CGTTCTTTTC CCCATGGCAG TAAAGATTGG GATTTACAAC 

45451 AAATTTCAGC ATGCCTTAAG ATTTGCTTCT ATGTATACGC CAATAAATGT 

45501 GGTTCTGGAA AAAAT AT AT A CCCCTTTATA CCCCCATTTT CAAGTACAAA 

4 5551 CGGTTCAAAG CTACTACAGG TTTTAATAAT CTGTTCACTT AGTAAAGGGA 

45601 ATTACCACTT GTTCTAAATA TAAGGTGCTG CCATAAATTA GTTTACATAG 

45651 TGAAGAAGAG TGTTCTTAAA TCTAAGCAGC TGCACACTCT GTGAAATCCT 

45701 TTCAGAATGA TAGTCATTGT GGTCTGAGCA GTAATTTCCT ATTCTTCGAC 

45751 CTTGGATTGA ATTTCCCTTA GCCTACATCT TGCCTTTCCA GCATATCTTA 

45801 CCTCAAACCT TCTTTGTGTT CCATTCCCAC CTAAGCTTCA AAATAGCCCT 

45851 GTGTTGACGT CGTCTTCCAT TTGCTGAGCT TACCTATGGA TCTCCAAGAA 

45901 CCCAGATCTT GAAACTGCTG ATCCAGCTTT GAGTATCATC ACTTCCCTGT 

45951 GGATTTAACT TCCATTAATT TTAAGGGACT ACTAAGTTAT TCCAGTGTGG 

4 6001 CATCACAGTG CAGTTAGCAA GCTCAGCTAC TTGACTCTAA TTTGGCCATG (SEQ ID NO: 3) 
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DNA 

Position 

397 TGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCCTCTCGGCCCACTGTAGCCTCCGCCTCCC 
GGGTTCAAGCAATTTTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCGCCA 
CCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGACAGTGTTTCACCATGTTGGCCAGGC 
TGGTCTTGAATTCCTGACCTCGTGATCTGTCCGTTTTGGCCTCTCAAATTCCTGAGATTA 
CAGGCATGAGCCACCGAGCCTGGCCAGTTTTCTGAGTTTTTATTTGAAATCAAAATAAGC 
[T,-] 

TTTTTTTTTTTTTTAATGGGCTTTAGAGTCCAGGGTAACGAACACTTTTTGGTGCCTATT 
ACTGAACCATTCAGGGTATTCCTGGGGTGGTGACCGTGTTCATTTCAGAAACCAACATGT 
TCATTTCAGAAACCAAACTCGGGTAACTTTTGATAAGTTCATCAACTAAGGCCCATGGCA 
GAATTTGAGGGCTAAGGGGTGTAATTAGTGTATGGGTAGAAATAAGTGCCTTCTTTCTAT 
AT TTTGGCGT T GT AG G AAT TT AAAGT GAT T CT G C AG T AAGT C T C AGG AG AC AAT T T T CT T 

232 6 GCTGATTGTGTTCTAGGGGACGGAGTAGGGGAAGACGTTTGCTCTCCCGGAACAGCCTAT 
CTCATTCCTTTCTTTCGATTACCCGTGGCGCGGAGAGTCAGGGCGGCGGCTGCGGCAGCA 
AGGGCGGCGGTGGCGGCGGCGGCAGCTGCAGTGACATGTCCAGCATGAATCCCGAATAGT 
GAGTTCAGGAGAGCACCGGTCGGCTGGGTCCGTGGGCCAGCTTGGGGGATCTTAAAGGGG 
TCGAGGAGGGTTGGGGCAGAAGTCGGGGCATCGGCTGGGGTGAGGCGAGGGTGATGGGTC 
[A,G] 

GGAGAGGCTGGCGGCCGGGAGTCGGGCCCCATTGTCTGACGCGGAGGGGCGGCCGCGCGG 
GGGAGGGGTCGGGCCGGAGGGGTGAGCCGCCCGGGCCTGGACCGGGTCAGGTTAGAGGGC 
CTGACTGCGGGGCGGGTGCTGAGGAAGCCTGCCGAGGGGCCTGGGGCGGTGTGAAGGGGT 
ATCTTCTCTCGGAGGCAGTGACTTTTGAAGGAGGACTTGTCTCTAAGGGGAGGGGATGGG 
GTGGGAGAGCCCTTCTAGAGGGCACTGTCAGACCCTGCGCCCGCACTCTGCGGAGCTGTC 

3486 CTGGGAACTGGTGTTCACTTCCCTTGGGTAGAGTTTGTTGGGCTCTCCTCAATGGCCCTT 
TAAAAATTTCCT CTACAGTTTACATGCATGT AAAGT AATGAATAATTGGAAGAGACCGAA 
TTGGTATTCCTTTTCAGTGTCAAAGGCCTTTGAGGGATGGGGGAAAATCAGTATTTGTTG 
TAAAAGTTGAGTTTATTTGCTGGTTTGGTCAATTACTGCTAGACATTTTCCCCTAAAAGG 
TCCACCCACCAGTTTAGCTGACTGTCATATGTGTGTCACATGGCTCTTGCAAAATGCTTA 
EC, A] 

AAGTTTTGTAATAGTGTGGCTTGAAGCTGAAATCTTTTGCACTAAACAGAAACCGTAGTA 
TTTTATTAGAATTTCATGCTTTAGAAGTTGAGGGTAGTGTTCTTGTAGTGACATTTGCTG 
TGTTGACAGTTTAAAAAAATTTTTTTTTCAAGGGCTCCAAGGACAAAGTTGGTTTTGCAC 
AGTTGAACGGAGGTGAACTTGAGGTTCTTAATTTAGTAGTTTTCTTGGTAACAATAAAGA 
ACATGGATTTACTGCTTTATCGAGGTTTATAGACCTCTACTGTTCAGGAAATTTTCTGAA 
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6651 TTTCAGCACATTAAGAAATGCTTAACATGGCCAGGCGCAGTGGCTCACGCCTGTAATTCT 
CAGCACTTTGGGAGGCCGAGGTGGGCGGATCATTTGAGGTCATGACCAGCCTGGCCAACA 
TGATGAGACACTGCCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTGGTGCACGCCT 
G T AAT T C C AG C T AC T C AG G AAC C T G AGG C AG G AG AG T C AC TT G AAC C T G G G AGG C GG AGG 
CTGCAGTGAGTCCAGATCATGCCACTGCACTCCAGCCTGAGGGACAGAGTGAGACTCCTC 
t-,A] 

AAAAAAAAAAAAAAAAAAG AAAG AAAT AC T T AACAT T ATT CT CGT GAT TAT T CT CAT AAC 
ATTTTTCATAATCCACTGGCTTCCAGTGGATTTTTTTAGTGTCAAGAAAATAATTTTGAT 
TGGTTCATCTTTAAGGAATGTGTTAAGAATAAAGCATGTCTACCTGTCTTCAGTATACCA 
GCTAACTATAGTAGGAAGAAATATAGTAGTCTACTTAGATCAACTATAATTCTTTAATGC 
AGAAAAAGTTTAAAGTATTTACCTTATTTTTAGCCCCCATCCCCTTAAGTATATCATGGC 

8190 AGACCGGCCTGGCCAATGTGGTGAAACCCTGCCTCTACTAAAAACACCAAATTAGCTAGG 
CGTGGTGGTGTGCGCTTGTAGTCCCAAGCTACTGAGGAGGCTGAGACAAGAGAATCGCTT 
GAATCTGGGAAAAAGAGGTTGCCGTGAGCCAAGATTGGCCACTGCACTCCAGCCTGGGTG 
ACAGAGTGAGATTCTGTCTCAAAAAAATAAAAAATAAAAATTTCCCCCTTTAATCAAATT 
AAGT T AAAATGAGGGATGTTAGACAGTTTTT AAC CATC AAATATTTTAGTTTAGTTTTTT 
[T,-] 

TTTTTAACGTTGTCTTAAAGATGGAAGTGCTTCAAAATCAAATCTTCCTTGCCAGTTCTC 
TACTTGGCTTCTTTTTTTTTCTTTTTGAGATAGAGTCTCACTTTGTCACTGGAGTGCGTT 
GGCGTGATCTCGGCTCACTGCAACCTCCGCCTTCCAGGTTTAAGTGATTCTTCCACCTCA 
GCCTCTCAAGTAGCTGGGAGTACAGGTGTGTGCCACCACACCCGGCTAATTTTTGTAGTT 
TTAGTAGAGACAGGGTTTCACTATGTTGGCCAGGCTGGCCTCAAACTCCTGACCTCGTGA 

8281 CTGAGGAGGCTGAGACAAGAGAATCGCTTGAATCTGGGAAAAAGAGGTTGCCGTGAGCCA 
AGATTGGCCACTGCACTCCAGCCTGGGTGACAGAGTGAGATTCTGTCTCAAAAAAATAAA 
AAAT AAAAATTTCCCCCTTTAATCAAATT AAGT T AAAATGAGGGATGTTAGACAGTTTTT 
AACCATCAAATATTTTAGTTTAGTTTTTTTTTTTTAACGTTGTCTTAAAGATGGAAGTGC 
TTCAAAATCAAATCTTCCTTGCCAGTTCTCTACTTGGCTTCTTTTTTTTTCTTTTTGAGA 
[T,C] 

AGAGTCTCACTTTGTCACTGGAGTGCGTTGGCGTGATCTCGGCTCACTGCAACCTCCGCC 
TTCCAGGTTTAAGTGATTCTTCCACCTCAGCCTCTCAAGTAGCTGGGAGTACAGGTGTGT 
GCCACCACACCCGGCTAATTTTTGTAGTTTTAGTAGAGACAGGGTTTCACTATGTTGGCC 
AGGCTGGCCTCAAACTCCTGACCTCGTGATCCACCCACCTCAGCCAAATTGCTGGGATTA 
CTTGTGTGAGCCACGCGCCTGGCTTCTACTTGGCTTTTAAAGGGAATTTTGCTTTCTGAG 

11546 GTTACATTTAACCCATTTATGGTCGTGTAGCCATACTCACGTTACATTTGATGCATCTGC 
TCCCTTTGTGTCTATATACTCATATAACATTTTGCATAAAGTTATAGGCAGTTCACACCA 
AGGCTGTTCATGAACCTCAGATTAAGAATACTTGATTTAGGAGATTGAAAACAGAAAAGA 
GAAT GT T AACTAT CAT TAT C AAT AT T AAAAT GT GAAAAT CT GAGAGT GACAAAGCT T AGC 
TTTAAATCTGGTATCCCAAACTCATTTGAGTTTTTTTTTTTTTTTTTTTTTTTTTGAGAC 
[A, G] 

AGGTGTCGCTTTGTCCCCCAGGCTGGAGTGTAGTGGTGTGATCTTGGCTCACTGCAACCT 
CCACCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCTGAAGTTGCTGGGATTACAGG 
CTGCGCCACCACGCCCAGCTAATTTTTTGTATTTATAGTAAAGACGGAGTTTCACCTTAT 
TGGCCAGGCTGGTCTCAAACTCCTGATCTTGTGATCCTCCCGCCTCGGCCTCCCAAAGTG 
CTGGGATTACAGGTGTGAGCCACTGTTCCCGGCCTAATTTGAGTTTTAAAATGTGGAGTT 

11670 T GT T CAT G AAC CT C AG AT T AAGAAT ACT T GAT T T AGG AGAT T GAAAAC AG AAAAG AG AAT 

GTTAACTATCATTATCAATATTAAAATGTGAAAATCTGAGAGTGACAAAGCTTAGCTTTA 
AATCTGGTATCCCAAACTCATTTGAGTTTTTTTTTTTTTTTTTTTTTTTTTGAGACAAGG 
TGTCGCTTTGTCCCCCAGGCTGGAGTGTAGTGGTGTGATCTTGGCTCACTGCAACCTCCA 
CCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCTGAAGTTGCTGGGATTACAGGCTG 
[C,T] 

GCCACCACGCCCAGCTAATTTTTTGTATTTATAGTAAAGACGGAGTTTCACCTTATTGGC 
CAGGCTGGTCTCAAACTCCTGATCTTGTGATCCTCCCGCCTCGGCCTCCCAAAGTGCTGG 
GATTACAGGTGTGAGCCACTGTTCCCGGCCTAATTTGAGTTTTAAAATGTGGAGTTTAAG 
ATGTTAGTCTTAAAGTGGGTTAGATGAAATTTATAAAAATAGTCAAATAGCTAAATTTAT 
AAAAGGCCATTTGAAACAATTTTGTGAAATATATAATGTGGATAATTATGTAGTGCTTTA 

11688 T AAGAAT ACT TGATTTAGGAG ATT GAAAACAGAAAAGAGAATGTTAACTATCATT AT CAA 

TATTAAAATGTGAAAATCTGAGAGTGACAAAGCTTAGCTTTAAATCTGGTATCCCAAACT 
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CATTTGAGTTTTTTTTTTTTTTTTTTTTTTTTTGAGACAAGGTGTCGCTTTGTCCCCCAG 
GCTGGAGTGTAGTGGTGTGATCTTGGCTCACTGCAACCTCCACCTCCCAGGTTCAAGTGA 
TTCTCCTGCCTCAGCCTCTGAAGTTGCTGGGATTACAGGCTGCGCCACCACGCCCAGCTA 
[A,G] 

TTTTTTGTATTTATAGTAAAGACGGAGTTTCACCTTATTGGCCAGGCTGGTCTCAAACTC 
CTGATCTTGTGATCCTCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCA 
CTGTTCCCGGCCTAATTTGAGTTTTAAAATGTGGAGTTTAAGATGTTAGTCTTAAAGTGG 
GTTAGATGAAATTTATAAAAATAGTCAAATAGCTAAATTTATAAAAGGCCATTTGAAACA 
ATTTTGTGAAATATATAATGTGGATAATTATGTAGTGCTTTATGTGTAGATTGGTGGTTA 

14 938 CATGGTAGTGTGCACCTGTAGTCCCAACCACTTGGGAGGCTGAGGTGGGAGGATTGCCTG 
AGGCCAGGAGTTTGAGACCTGGGCAGCATATGAAGACCCTGTCTCTAAAAAACTAAAAAT 
AAAAAATAGCCAGGTGTGGTTGGTGTGCTTGTGGTCCCAGCTACTCAAGAGGCTGAGGCA 
AGAGGGTTGCTTGAGCCCAGAAGTTGGAGGCTGCCGTGAACTGTGATTGCACCACTGCAC 
TTCAGCCTGGGTGACATAGCAAGACCCTGTCTCTGTGGTGGTGGTGGGTGGGGGTGGGGG 
[A,C] 

AGGGATTTAAGAAGGGTTTGTGAGGTATGTATTATTTATAAATGGGCTTTTAACTTTACC 
CTTCACATCTTGGGTTGAAATTAATTGTATCCATTCTCAGTTTTTCTGTCTTGCTATATA 
TTTAAACTTGGAGACTTAGAGGTCATGGATGTCTTTCTATGi\AAAGC7\AATGAAGCAGAG 
GGCTGCCTTCTCTTGCTGTAGAGGGCACACTTGCTGCAGAGCATGTTACTGTTTTATGCA 
TTGCTAGGCTTTGGGAGTTGTGACTTGTATGATCATAGTACTTACAACTATTAGTTGGCA 

22261 CACCCACAGATAGCTATGTCAAACGTAAGGGTGGAGAAACACAGACCCCAAACTTCTCGA 
GGGTAGAAAATATGAGGTTATAGTAGATTAGAACTACAAAAAGCTAGAGGAAGTTCTGAA 
CTGGAAACAGTGGATAGGATTTACTAGAATAATTTACGAGGGTGACAATTGTAAATCTTC 
ATAGGTTTCTTTTTTTTCCTTTCTCTTTTTTTTTTTTTGAGATGGAGTCTCGCTCTGTTG 
CCCAGGCTGGAGTGCAATGGCGCAGTCTCTCCTCACTGCAACCTCCGCCTCCTGGGTCCA 
[G,A] 

GTGATTCTCCTGCCTTAGCCACCCAAGTAGCTGGGATTACAGGCATCTGCCACCATGCTG 
AGCTAATTTTTGTATTTTTTTTTTTAGTAGAGACGGGGTTTCACCATGTTGGTCAGGCTG 
GTCTTGAACTCCTGACCTCAGGTAATCCACCCACCTTGGCCTCCCAAAGTGCTGGGATTA 
CAGGTGTGAGCCACCGCGCCCAGCCAAATTTTTATTGGTTTCTAAACTAGCGTAATTTAG 
TTTTTTTCACTTAAGTCAAAATTATATTATTGTAGGATAAAAACTTAGTGATCCAAATTC 

22852 ATCCAAATTCATGAGGAATGAAGAATAAATACATTTAAAGTCTTACCATTTGCTAAATTA 
GTCTTGGCTCTTTGTACCAAAATTCTGTCCTTGTGCTCTGTAATTTTATATTTGTATATT 
TTCTATCAACATTTTTACTGTGTGGTGTTTTGTAAATTATAAAAACGTTTTAAAGCAAAC 
TCAGAACAATGAATTCTCACGAATATTCAGTATATTTACAGTTGAGAAATAAACTACTTC 
TGTAGTAGGTAATTTAAAATGTCCCAATGCAAGTTAACGTGTCACTGATCACGCTATTCA 
[G,A] 

GTGTGTGTCTTTGATAAGGGGAGGTGGGGAAGTTTGTGGGTTTGATTTTATTTGCCTTTC 
TCATGTGACTGTTGTCATGTTAGTAAACAAATGGTTTGCGAGAGAACCAGTAGTCTTTTG 
CAAAGATTGTCTTATACAGAGCACTCAATTCTTCATATTATTTATAATGGCTTTAATTTA 
AGCCTTAAATTATTAGAAACTCATAAATAATTTTTTTATTTGTTTTTTTGAGATGGAGTT 
TCGCCCTTATTGTCCAGGCTGAAGTACAATGATGTGATCTTGACTCACTGCAACCTCCGC 

27253 GCTTAAGCCATGCATGGGCTTTATAGGAGATGTAGTCTTCACAGTGAGTTGTTATTTGTA 
GCTGTGTTTTTGTTTTTGTATAGCTTATAGCAATGCAGTGTGCTTTTTATTAACATCATT 
TTCTTTTTCTTTTTGCAGTGATTATTTATTCAAGTTACTTCTGATTGGCGACTCAGGGGT 
TGGAAAGTCTTGCCTTCTTCTTAGGTTTGCAGTAAGTTGAAATTGAAATGTCTTTACAAT 
TAATGGTACAATTAATGCTATGTATGTTTTCTAGGTAGATAAAATTAAACAGTTTTATTC 
[A,C] 

GAATAAGTTAATTCTTCCAGAATTTATATATTTAAAGACTCCAAATATACATCCCCAGTG 
GTATCTTGGACTGTTAAATAGAAAAATATTGTTGCTCTTAAAAGAAATTCAGTGAAGTCT 
GGTTATAAAGTCAGAATGTCTAATACTTTTGGTCAGAGTCAAACAGCAGTTCCAATATAG 
GCAGCAAGTTAAAGGGGTAGTTGGTGGCCTGTGTTGAAAGCGACTTGATGAAAATAAATC 
TTTAAATTAAACTTTAGTAGAATAAAAAGAAAAAGCAGAGCCAGGTGACGCAGTGGATCA 

28098 CTTTAAATTTAGCATGTTTCCTGGCCAGGTGCGGTGGCTCACGCCTGTAATCCCAGCACT 
TTGGGAGGCCGAGACGGGCGGATCACAAGGTCAAGAGATTGAGACCATCCTGGCTAACAC 
GGTGAAACCCCGTCTCTACTAAAAATACAAAAAATCAGCTGGGTGTGGTGCCACACGCCT 
GTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCGGAGG 
TTGCAGTGAGCTGAGATGGTGCCACTGCACTCCAGCCTGGCAACAGAGCAAGACTGTCTC 
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E-,A] 

AAAAAAAAAGAZVAAAAAATAAAAAMCAAATTAGCATGTTTCCCTTCTAGAGATCATTGT 
TTCTCAGAGCATGGACCAAAGACTCCTGGGGGTTACCAAGACCCTCTCAGGTAGCCCATG 
AGGT CAAAAT AT C CT AAT AAT ACT AAG AT GTT AGT ATT T GT AAG GAAAT AT T TACT TGGT 
AATAATACTAATATAAAAGATGTTTGCGTTTTTCAGTGATGACATTGGCTCTGGTACAAA 
AGCATGTGGGTAAAATTGCTGCTGGCTTGGTACACATCAAGGCAGCGCTAAGCTCCAAAT 

28597 GATGTTTGCGTTTTTCAGTGATGACATTGGCTCTGGTACAAAAGCATGTGGGTAAAATTG 
CTGCTGGCTTGGTACACATCAAGGCAGCGCTAAGCTCCAAATTGTACTCATGGTGATGGC 
ATTCTTTACCTCTGTGCCCTCACAGGAACAAAAACAAGCCGTGCCATTTTTATTGAAGAT 
TGTCCTTGACAAAACAGTTAAAATGATTAATTTTTGAAAAATGTTGATCCATGAGTATTC 
CTTTAAAAATATTTGTGAAGAAATGGGAAGTTCACATAAAACAATGTTTTTTTTTTGTTT 
[G,T] 

TTTTTTTTTTTTTTTTTGAGACAGATTCTGGCTGTGTTGCCAAGGCTAGAGTGCAGTGGC 
GTCTGGCTCCCAGGCTCAAGCTGTTCTCCCACTTCAGCCTCCCAAGTGGCTGGGACCTCC 
CAAGTGGATGCGCCATCATGCCTGGCTGATTTTTGTATTTTTTTGTAGTGACAAGGTCTC 
ACTGTGTTGCACAGGCTGGTCTCAAACTTCTGAGCTCAAGCGATGCATGTGCCTCAGCCT 
CCCAAAGTGCTGGAGAAAGCACTTTTTACTGCATACTGGCTAGTGTGTTGGTTATTTTGG 

31431 CTGCATTTTTTTTTTTTTTTTTGGTTTGAGATGGAGTCTCGCTCTGTCGCCCAGGCTGGA 
GTGCAGTCGTGCAATCTCGGCTCACTGCAGCCTCCACCTCATGGGTTCAAGCGATTCTCC 
ATCTTGGTCTCCTGACTAGCTAGGTTTACAGGCGTGTGCCATCACACCCACTAATTTTTT 
GTATTTTTAGTAGAGACAGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGATC 
TAAAGTGAGCCTCCCACCTTGGCCTCCCAAAGTGCTGGGATTACATATGTGAGCCACTGC 
[C,T,G] 

CCTGGCCTCTATATACTTCTATAGTACCTGATACTTATTAGGCACTCAATTACAACATAA 
CTTTTTTTTTTTTTTTTTTTTTTGAGACAGAGACATGCCTTGTCGCCTGGGCTGGAGTGC 
AGTGGCACAGTCTCGGCTCACTGCAACCTTCACCTCCCGGGTTCAAGTGATTCTCCTTCC 
TCAGCCTCCCGGGTAGCTGGGATTACAGGCGCCCGCCACCACGTCCAGCTAATTTTTTGT 
ATTTTTAATAGAGATGAGGTTTCACCATCTTGGCCAGGCTGATCTCAAACTCCTGACCTT 

357 04 ATGTGTGATCATTGGTGTTTATAAGATTTGGGTGTGTATTCGTGTGTGAAACATTCATAT 
TTTGTTACTTTCCTGTGGCTGGAAGGGATCTTATAGGACACTGTCTTTCATCTTTGTCTG 
TCTTTCATCTTTAATAGGAATTTCTTTTCCATGCCTGAAGGCCTCATTTTGAACATTTTG 
TTTGTTTGTTTTTTTATTTTTTGAGATACAGTATTGCTCTGTCTCCCAGGCTGGAGTGCA 
GTGGCGCGATTTGAGCTCACTGCAACCTCCGCCTCCTGGGTTCAAGTGATTCTCCTGCCT 
[C,T] 

AGCCTCCCTAATAGCTGGGATTACATGTGTGTACCACCATGCCCGGACAATTTTTTTTTT 
TTTGAGATGGAGCCTTGCTTTGTCGCCCAGGCTGGAGTGCCAGTGGTGCAATCTTGGCTC 
GCTGCAGCCTCCGCCTCCCAGGTTCAAGCAGTTCTCTTGCCTCAGCCTCCTGAGTAGCTG 
GGATTACAGGCGTGCGCCACCACACCCTGCTAATTTTTTGTATTTTTAGTAGAGACAGAG 
TTTCACCATGTTGGTTAGGCTGGTCTCGAACTCCTGACCTCGTGATCTGCCTGACTCGGC 

35728 GATTTGGGTGTGTATTCGTGTGTGAAACATTCATATTTTGTTACTTTCCTGTGGCTGGAA 
GGGATCTTATAGGACACTGTCTTTCATCTTTGTCTGTCTTTCATCTTTAATAGGAATTTC 
TTTTCCATGCCTGAAGGCCTCATTTTGAACATTTTGTTTGTTTGTTTTTTTATTTTTTGA 
GATACAGTATTGCTCTGTCTCCCAGGCTGGAGTGCAGTGGCGCGATTTGAGCTCACTGCA 
ACCTCCGCCTCCTGGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCTAATAGCTGGGATTA 
[C,T] 

ATGTGTGTACCACCATGCCCGGACAATTTTTTTTTTTTTGAGATGGAGCCTTGCTTTGTC 
GCCCAGGCTGGAGTGCCAGTGGTGCAATCTTGGCTCGCTGCAGCCTCCGCCTCCCAGGTT 
CAAGCAGTTCTCTTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCGTGCGCCACCACA 
CCCTGCTAATTTTTTGTATTTTTAGTAGAGACAGAGTTTCACCATGTTGGTTAGGCTGGT 
CTCGAACTCCTGACCTCGTGATCTGCCTGACTCGGCTTCCCAAAGTGCTGGGATTACAGG 

36690 AAAAAAAAAAAAAAAAGTAACCAGGTGTGGTGGTCCATGCCTGTAGTCCTAGCTCCCCAG 
GAGACTGAGGTGGGAGGAATGTTTGAGCCCAGGACTTCAAGGCTGCAGTGAGGCAAGATT 
GCACCATTGCACCCCAGCTTTGGGGACAGAGTGAGAGACCCTGTCTCAAAAACAAAATAA 
GGCTGGGCGCAGTGGCTGTCCGGGCGTCGTGGTTCACGCTTATAGTCCTAGCACTTTGGG 
AGGCCAAGGTGGGCAGATTGCCTGAGCTCAGGAGGTCTAAGACCAGCCTGAGCAACATGG 
[C,T] 

GAAAC CT C AT CTT T GC AAAACAT ACAGAAAAAAAC AAAAAAAAC CACAAAACC T CT AGTT 
GCCAGTTATTTTTTTTATTTATTCCTAGTGATTCTTCTTTTTTTCTTTTTTCTGAGACAA 
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AAATTTCACTTTGTCTCCCTCGCTAGAGTGCAGCGGTCAGCTCACTACATGATTCTTTTA 
GAGACATGTTAATTCTTTATATTGAGCTGAAGCCTGTTTCTTTTACTTCTGTCTCTTCTT 
ATTCCTCCGCCTTGTAGAGCTGCCTGAATCAGATTAATTCCTCTTTTATTGGCAAGCCTG 

41002 GAGTTGAGGACTAATGTTTCTATATCACATCCTGATAATCTCCACAGTTATGAAAACTAA 
ACTATTTCCCCTCCCTCCTACACTTTTCCCCAACTTTATTTTAATGGAATTGTTTGGATT 
TCTTGATTGTTTTGTAATAGTGGGACACAGCAGGCCAGGAAAGATTTCGAACAATCACCT 
CCAGTTATTACAGAGGAGCCCATGGCATCATAGTTGTGTATGATGTGACAGATCAGGTAA 
GTTCCAAGAGGAGATTGTGTTACAGTGACCAAGTAGGAAGCCATTATTTGATTAATGTCA 
[G,C] 

ATTCATTTACTACTTCATATATAAGCCATCAGTATTAATTTTATGGCAGAAAACTTTGTC 
CACT C T CAAATATAAAT GT GAAT C ACT TAAAAGAC AT TTGTTTTC CTGT AAT AAAT AAAA 
GATTAGTAATTAGTTTTACGTTTGCTTTCAAGGGATTCTGGTTGTATTTATTGTCAACTA 
AAT AA C T T T GAT C AAAT AG C C AAG AC T C T AAC AT AT AGG C AAG AG T T T G T AG GG AAT C G T 
GAGTTGCTTGGCTTATACTGTGTTCTTGGTGTTAAGTATTAACAGGAATATGGCCTGGTA 

41033 CTGATAATCTCCACAGTTATGAAAACTAAACTATTTCCCCTCCCTCCTACACTTTTCCCC 
AACTTTATTTTAATGGAATTGTTTGGATTTCTTGATTGTTTTGTAATAGTGGGACACAGC 
AGGCCAGGAAAGATTTCGAACAATCACCTCCAGTTATTACAGAGGAGCCCATGGCATCAT 
AGTTGTGTATGATGTGACAGATCAGGTAAGTTCCAAGAGGAGATTGTGTTACAGTGACCA 
AGTAGGAAGCCATTATTTGATTAATGTCAGATTCATTTACTACTTCATATATAAGCCATC 
[A,G] 

GTATTAATTTTATGGCAGAAAACTTTGTCCACTCTCAAATATAAATGTGAATCACTTAAA 
AGACATTTGTTTTCCTGTAATAAATAAAAGATTAGTAATTAGTTTTACGTTTGCTTTCAA 
GGGATTCTGGTTGTATTTATTGTCAACTAAATAACTTTGATCAAATAGCCAAGACTCTAA 
CATATAGGCAAGAGTTTGTAGGGAATCGTGAGTTGCTTGGCTTATACTGTGTTCTTGGTG 
TTAAGTATTAACAGGAATATGGCCTGGTAATTAGAACTTGTCCATCAGAATTGCCAAAAG 

4 3161 AGTCCTTCAATAATGTTAAACAGTGGCTGCAGGAAATAGATCGTTATGCCAGTGAAAATG 
TCAACAAATTGTTGGTAGGGAACAAATGTGATCTGACCACAAAGAAAGTAGTAGACTACA 
CAACAGCGAAGGTATGTTTAAAGTTTAATTTTCATACTGAATTTGAAGGTGTTGAATTAT 
GTATGGGTTCTGCAGTAACAGTAAGGCCACAGCCTTTTAAAAATATGTGCACTAGAATAC 
TGTGACAGTGACAATTTGTGTAGCATCTGTTTGGATCCAATGAACTTAGTTCCTCACGCT 
[C,T] 

CAT TAT G GAT G G TAG AAAT G C AG T AAG AAT T AGT G AAAAAG AT T T T T C AG T G T T AAT T G T 
GCCTCATTATTCTCTTAGGAATTTGCTGATTCCCTTGGAATTCCGTTTTTGGAAACCAGT 
G CT AAG AAT G C AACG AAT G T AG AAC AG T C T T T C AT G ACG AT G G C AG C T GAG AT T AAAAAG 
CGAATGGGTCCCGGAGCAACAGCTGGTGGTGCTGAGAAGTCCAATGTTAAAATTCAGAGC 
ACTCCAGTCAAGCAGTCAGGTGGAGGTTGCTGCTAAAATTTGCCTCCATCCTTTTCTCAC 

4 37 65 AAT GAAT TTGCAATCTGAACCCAAGTGAAAAAACAAAATTGCCT GAAT TGT ACT GTATGT 

AGCTGCACTACAACAGATTCTTACCGTCTCCACAAAGGTCAGAGATTGTAAATGGTCAAT 
ACTGACTTTTTTTTTATTCCCTTGACTCAAGACAGCTAACTTCATTTTCAGAACTGTTTT 
AAACCTTTGTGTGCTGGTTTATAAAATAATGTGTGTAATCCTTGTTGCTTTCCTGATACC 

AGACTGTTTCCCGTGGTTGGTTAGAATATATTTTGTTTTGATGTTTATATTGGCATGTTT 
[A,G] 

GATGTCAGGTTTAGTCTTCTGAAGATGAAGTTCAGCCATTTTGTATCAAACAGCACAAGC 
AGTGTCTGTCACTTTCCATGCATAAAGTTTAGTGAGATGTTATATGTAAGATCTGATTTG 
CTAGTTCTTCCTTGTAGAGTTATAAATGGAAAGATTACACTATCTGATTAATAGTTTCTT 
CATACTCTGCATATAATTTGTGGCTGCAGAATATTGTAATTTGTTGCACACTATGTAACA 
AAACAACTGAAGATATGTTTAATAAATATTGTACTTATTGGAAGTAATATCAAACTGTAT 

4 4713 AAGCAGCACCTTTCCTAATTGGCAAATGATCAGACTAATGTGTGCTAATGTTTTTCTTCC 
ATGCTTTCAGTCAGATTCAACTATTTTATCCTCCACAGTTGCTTAACTTGGTGTTGGAGG 
AGGG T T T AAG CAT T AAG AT AGGAAG C AGG AAAT T T GAT T G CT C T AAAT T TAG AAAT TAT A 
TCCCTAAAAATTAAAACATGAATACTGGGTGGTAATGATAATTGAGGCAAATGTATTTAT 
TTTGGTGACATTTTGCATATATGAAGATTTTCTGAAATAGGACCTTCAAGATCCTAGGGG 
EG,T] 

TTTTGTTTGGTTTTTAATTGTGAGGAATAAAAAATCTTCTGCCCACACTGGCATTTTAAG 
GTGACTGAGGTCAAACGTTGTTTCCTTAGGTTGAAATAGCAGCCAAAACATTCTTCACGC 
AGGGGCTTGGGATATGGCTGCTGGCAACACATTTTGTTGTGGGCTCCTTAATTTAATGAT 
AAAATTTAAGCTAAACACAAGCCAAAAATGAATAGGTTTTTTTAATTTTTATTTTTCACT 
AAACAGGCAATTGAAATACATGGTACAAAAATAAGTGGTAAGATAATTGTAAAATGAAAT 
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44831 GGAGGGTTTAAGCATTAAGATAGGAAGCAGGAAATTTGATTGCTCTAAATTTAGAAATTA 
TATCCCTAAAAATTAAAACATGAATACTGGGTGGTAATGATAATTGAGGCAAATGTATTT 
ATTTTGGTGACATTTTGCATATATGAAGATTTTCTGAAATAGGACCTTCAAGATCCTAGG 
GGGTTTTGTTTGGTTTTTAATTGTGAGGAATAAAAAATCTTCTGCCCACACTGGCATTTT 
AAGGTGACTGAGGTCAAACGTTGTTTCCTTAGGTTGAAATAGCAGCCAAAACATTCTTCA 
[C,T] 

GCAGGGGCTTGGGATATGGCTGCTGGCAACACATTTTGTTGTGGGCTCCTTAATTTAATG 
ATAAAATTTAAGCTAAACACAAGCCAAAAATGAATAGGTTTTTTTAATTTTTATTTTTCA 
CTAAACAGGCAATTGAAATACATGGTACAAAAATAAGTGGTAAGATAATTGTAAAATGAA 
AT GGAC AGAAT AT T CAAT T TT C CAT CT AT G AAAAT T T C ACAAT AAAAAT C AT AGT T TACT 
TTGTATTATAGGCGTGCTTGGTGGATCTATTCATCCTCACATAAGGCAACTGACAAATTC 
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